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le PRELIMINARIES 


A. INTRODUCTION 

In todays world of rapidily advancing technology, we 
rely more and more on high speed computers, electronic 
communication equipment, etc.. Data streams are handled in 
these devices as a series of electronic pulses and non- 
pulses. Numerically, we view such data as a sequence of l's 
maemo S, associating a | with each pulse and a 9 with e@acn 
fon—-pulse. 

With this technology, there arises Aa need to generate 
random-like sequences of l's and W's. Such sequences are 
useful in the areas of coding and communication as well as 
in mathematical modeling. One of the simplest and most 
efficient ways of generating such a sequence is with a 
binary shift register. 

mae CUtpuct Cf these shift registers give rise to 3 
Bieecial Graph called a de Bruijn granohn. Knowledge about the 
eeeeuctire Of the de Bruijn graph has proven useful in 
@Bemerating and analyzing the corresponding sequences of L's 
and O's. 

Extending the existing theory of de Bruijn graphs, we 
define a cover of the de Bruijn graph and examine properties 
of such covers. Our main emphasis is on tne cardinality of 


these covers. Combinatorially, we are able to place upner 


and lower bounds on the cardinality of these covers. We then 
search for methods to form covers that approach these 


bounds. 


B. BINARY SHIFT REGISTERS 

A binary shift register Of span nh IS a collection me am 
storage devices X),x5,...,x, each capable of holding either 
a © Or al tand 4 Euncteion E(x, X95, e060 0X), taking on values 
Of JY or 1, computed from the contents of the storage 
devices. At the pulse of an external clock, the contents of 


register x. 


i+] 1S shifted to the register Xe for 1 = 


2,---, nm=-l. At the same time, the value of E(x) X5,--2 0X4) 
is fed into the register x,- The output of the shift 
register can be taken to be the Sequence of bits that 
appears in any of the n stages of the register or it can be 
taken to be the whole contents of the n stages. Displayed in 
Fig. I.l1 is a general n-stage shift register. The dependence 
on the clock has been suppressed. 

The contents of the shift register (regarded as either 
an integer or a binary n-tuple) is called its state. Since 
each stage can be either a 6 or al, there are pas possible 
states for a register of n stages. Define Vn as the space of 
these binary n-tuples. With every pulse of the clock, a 


transition is made from one state to the next. Thus, the 


feedback function induces a mapping F from ve to Vie 
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In this mapping, the n-tuple (X) 4 Xo,--+-%,) is mapped to 


(X54, Xge00 ee Xe EX X Qe 0+ Xy))- Since the value of the 
function f is either al ora YJ, the state (X),X5, eel, on) 
“ap 2 Xn-1 Xn 
| | | | ao | | | 
| | | | | | | | 
4+—| \<q—— —_ . | | <q— | | 

| | | | | | | 
We L || 
(£04 13694 ++ %q))———_> Xn) 


Fig. I.l1 The n-stage Shift Register with Feedback Function, 
EX) Xqe0+2 eX) 


has two possible successor states, (x 3) anda 


pres ee Xs 
5 +s X71) Vie call these states conjugate states. Note 
that conjugate states agree in all but the last bit. In the 
Same manner, the state (X),Xoe---0X)) has two possible 
predecessor states, (O,X),..-,X 1) and (1, xX ,,-+++X,_1)- We 
call these states companion states. Companion states agree 
in all bits except the first. Thus, each (n-l)-tuple forms 
an adjacenc uadruple as seen in Fig. 1.2, where a pair of 
companion states maps to a pair of conjugate states. The 
superposition of all gn-l adjacency quadruples determines a 
directed graph. This graph is called the de Bruijn graph B 


n’ 


named for the Dutch mathematician N.G. de Bruijn who studied 


a 


these graphs in reference 1. In Fig. I.3, we give the de 


Bruijn graph corresponding to n = 3 and 4. 
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Fig. 1.2 Adjacencey suadrupie 
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Fig. I.3 de Bruijn Graph for n = 3,4. 
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Cc. de BRUIJN GRAPH PROPERTIES 

In the de Bruijn graph, each possible state (viewed as 
n-tuples) of the shift register is represented by a node. 
Thus, there are 2” nodes in the de Bruijn graph of order n. 
Each possible transition from one state to another is 
represented by an arc connecting those two nodes. Since each 
state can map to one of two successor states, each node has 
two arcs emanating from it. These two arcs lead to conjugate 
nodes. Thus, there are gntt arcs in the de Bruijn graph of 
Seder nm. Lach node also has two arcs coming into it. These 
arcs come from companion nodes. Viewing these n-tuples as n 
bit binary numbers, we see that the two nodes that are 

n 

successors of a node x, where x = xy20 are the nodes 
2x (mod 2") and 2x + 1 (mod on) Whether ae node x 1s 
mapped to the node 2x (mod 2") or the node 2x + 1 (mod gs 
is determined by the feedback function. The two predecessors 
Me weEne node x are the nodes {|x/2!] and {x/2| + gn-l where 
Ey} is the greatest integer less than or equal to y. 
Likewise, the node that feeds into node x is determined by 
the feedback function. 


Paeoathn Of length k in 8B iseeon peer Ceri On 2Oot —nodes 


n 
Ny rNgee eee My yy Suiem thate an ane Gennecting n, to n34) Len 
i= 1,2,...,k exists in B.. The path is called simple if no 


meee appears On it more than once. A cycle of length k is a 


feel Of length k in which Nn, = m4 - A cycle is simple if no 


shes: 


node appears on it more than once before the start nod2 is 
repeated. 

Since a cycle of length kK will repeat itself every kK 
nodes, the output of the shift register will repeat itself 
every k bits. Thus, a cycle of length k can be described by 
the first k bits (or any k bits) of the shift register. If 
k < n, we repeat the k bits enough times to fill the n-long 
register. 

The 3-cycle 99 --> O81 "=> 19 <--> OH in Bo can be 
represented by (G91). The 3-cycle O1 --> 11 --> 19 -=> O1 in 
Bo can be represented by (G11). These cycles, (9081) and 
(911), appear in the graph B for all n > 2. However, theiE 
integer decimal representations change according as aa 
length of the register, changes. Since the cyclic Sn) stcuien 
WOl and 91l represent all of the possible ways to write a 
binary 3-tuple, (UJ%1) and (@11) are the only 2-cycles in oe 

A set of disjoint cycles that include every node In BL 
is called a factor of B.. In a factor, no nodes or ares are 
repeated. One such factor is defined by the mapping 
This mapping is called 


Pi(X),X50-+++X)) mise ee Sy eS 0 


no 1): 


the pure cycle mapping and the associated shift register is 
called the Pure Cycle Register (PCR). Golomb has shown that 
the pure cycle mapping decomposes the graph into 2Z(n) 


disjoint cycles where 


Z(n) = 1/n )9(a)29/¢ 
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moet. 2:DO.  lic—-121]. The summation is over all divisors d 


of n and 9(d) is Euler's phi-function. 


Edi t.4 7Pure Cycbese(n = 3). 


In a general graph, a path which goes through every node 
@aee ana only once is Known as a Hamiltonian vath. We can 
always extend a Hamiltonian path in a de Bruijn graph so 
that the last node in the path connects with the first node, 
meemg a cycle. This cycle is Known as aede Bruijn cycle. 
Beme> a de Bruijn cycle visits all the nodes in BL only 
Smee, tt iS a factor consisting of only a single cycle. it 
Can be shown that there are 2” such eycles in Boe where 
9(n-1) 


PeemeRet. G21. “Oneurstiche cyelomrin Be is 


(130010111). The cycle can also be written as the cycle 320 


Xx 


—-—-> 901 --> 910--> 1081 --> 911 --> 111 --> 116 --> 1909 --> 


690, witich can be seen in Fig. 1.5. 


PS 


An Eulerian path in a graph 1S 4 path whitch @itraveree. 


each arc exactly once. If the arc connecting the nodes 
. a 
iia 102 
012d 
oe 
ne _” 


Fig. 1.5 The de Bruijn Cycle (27919 mee 


) in 8. is labeled with 


CX), X5,00-0X " 


) and (X5+Xgeeee, 


n aa 


the (n+l)-tuple CX Xe eee eX ye Xn gy) then the arcs inj 


COLT ECSDONG EO. ene —mOoGces jin B Atl: Then two nodes in Baty are 
Connected by an are if their Gormrespondina anes 34 are 
such that the first arc enters a node in oe while the second 
arc exits that same node. Thus, an Eulerian path in BA 


defines 4 Hamiltonian path, in) se As the de Bruijn graph 


n+l ° 


1S connected and has 2 ares in and out of each node, there 


is always an Eulerian path in BY [Ref. 3], hence a 


Hamileconiam Dath® in 8 ; 
7 hast! 
Two isomorphisms exist in the de Bruijn graph in which 


the structure of the graph is preserved. They are the binary 


reverse and the binary complement mappings. Under wags 


reverse mapping, each node CX), Xoe00+2X ) is mapped to the 


ia 
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node ce, eer ky) As an example, the node 991 in B. is 
mapped to the node 19090. In the complementary mapping, all 
G's are changed to l'‘s and 1's to 9's. So, the node 991 is 
mapped to the node 119. 

For convenience, we will typically view the nodes in 3, 
in their decimal representation. Thus, the nodes will be 
numbered 3, 1, 2,..., 2% - 1. When more useful, we will 


recall the binary structure that fostered this graph. 


ms, . 
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Brg.) 1-6 de Bruljn Graph for n = 3,4 
(Decimal Representation). 
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Tl. "“OvVERTNGS 


A. DECOMPOSITION OF THE de BRUEIN "6k eeu 

A problem that has been considered for the de Bruijn 
graph BL is the maximum number of cycles into which the 
graph can be decomposed. Clearly, this number must be at 
least Z(n). In section I.C we saw that the pure "G@yaae 
mapping formed a factor in the de Bruijn graph consistingm@e- 
Z(n) cycles. A-long standing conjecture by Golomb was “Chew 
the maximum number of cycles into which 3 can be decompes=s 
1S. 2n) TURE. eo vows a 

In trying to prove Golomb's conjecture, Lempel macau 
Conjecture “of “hls Own 9~whateh implied Golomb's [Ref. 4}. 
Lempel's conjecture was that the minimum number of nodes, 
whieh 2 “semoved  £ron B that will result in an azyeiame 
Grape ts. 2 ne). 

It is clear that at least Z(n) nodes would be Ye yaw 
to solve Lempels conjecture. There would have to be one node 
lying on each of the 2(n) cycles from the pure cya 
mapping. The remaining problem was to find a set of Z(n) 
nodes for each n that, when removed from Boe would leave the 
graph acyclic. Mykkeltveit gave a constructive proohwyes 


Lempel’s conjecture by providing an algorithm to fing eae 


nodes [Ref. 5]. This in turn proved Golomb’s Cconjcctamen 
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Be REMA LINING PATHS 

Once the de Bruijn graph has been decomposed in the 
above manner, there 1S some interest in the structure of the 
remaining tree. In particular, knowledge about the lengths 
of the remaining paths is desirable. Long paths in the 
decomposed de Bruijn graph have cryptographic application. 
Because of their acyclic nature, these paths can provide a 
cryptographic key used for encoding. 

However, there are other applications of a hash coding 
flavor in which long paths in the decomposed graph are not 
desirable. For these applications we could even remove a few 
more nodes from the de Bruijn graph so that the remaining 
tree contains no unsuitably long paths. This process of 
removing more nodes can be continued, resulting in a forest 


whose maximum height tree is as short as desired. 


C. COVERING SETS 

Taking this deletion process to its ultimate extreme, we 
pose the question: "How many nodes must be deleted from B 
so that there are eae paths of length 9 remaining?”. In 
other words, we want to remove enough nodes so that no two 
adjacent nodes remain in the decomposed graph. Futher, the 
set of nodes remaining should be big enough so that no node 
can be added without creating a path. 

By deleting only those nodes necessary so that no path 


exists, the remaining nodes form a maximal independent set. 


nS 





(a) (b) 


Fig. II.1 Maximal Independeme Sets 
(a) (1,6) and@(5b)" 1273 B.. 


The set 1S independent in that if a node x is an element of 
the set, then neither the predecessors nor the successors of 
X are in the set. The set is maximal in the sense that no 
other node can be added to the set without creating a path. 

In Fig. II.la, we see that if all nodes except nodes 1 
and 6 are removed from Bs, we are left with a maximal 
independent set. Any other node in B, is either a vredecesor 
Or a successor of either 1 or 6 and is therefore excluded 
from the set. 

In Fig. II.lb, the nodes 2 and 3 form a4) maxiaga 
independent set in B,. Any other node except @ is adjacent 
to either 2 or 3 and cannot be in the set. The node @ cannes 


be added to the set either because, i1f it 1s left in the 


graph, a path to itself will be created. The node 2 
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which in B, is the node labeled 7, will also allow a path to 
itself. 

A subset S of the nodes of ey is defined to be a cover 
ot Bo if S 1s a maximal independent set and for every node x 
in B,- S, there exists a node y in S such that either an 
Basane< x5 y > aie an are <y,x> is in Bo: 

As mentioned before, special consideration needs to be 
given to the nodes @ and 2°-1. Because a path exists from @ 
to 9 and from 2%=1 to 2-1, neither of these nodes can be an 
element in a cover. Yet we need to ensure that these nodes 
are themselves covered. For the node 9 to be covered, either 


gn-l must be in the cover. Likewise, for the node 2 || 


imaor 
to be covered either ones or 2-2 must be in the cover. 


The set {1,6} forms a cover of B while the set {2,3} 


3° 
does not as it fails to cover the node J. 
Tt should be noted that this definition of a node cover 


is not the normal graph theoretic one in which nodes cover 


their incident arcs. 


i. LOWER BOUND 

When considering covering sets, the question OE 
cardinality arises. If the nodes in the cover are chosen 
carefully, how large a covering set can be obtained and, 
Similarly, how small a set can be obtained which still 
satisfies the requirements of being a cover? 

If a node x is in a covering set S, then the four nodes 


adjacent to x are considered to be covered by x. If we can 


ak 


TOE gape COV ¢ ane B. in which each node Gn Bl =o 7eeu7 
relative complement of S in Be is covered by only a single 
node in S, then we will have formed a covering set S GE 
minimum cardinality. In this case, each node in S covers a 
total of 5 nodes =- itself and the 4 nodes adjacent to it. 


Thus a lower bound for the number of nodes in a cover is 


Gaverminy ol. /5\e 


E. UPPER BOUND 

As noted in section I.C, a de Bruijn cycle Tome 
Hamiltonian path through the nodes of Bo. In choosing nod2s 
Lom ds Covers Lor Bi, we see that no two adjacent nodes in a 
de Bruijn cycle can be in the cover. Thus the nodes in a 
cover can be no closer on a de Bruijn cycle than every other 
node. This leads us to an upper bound of (2°)/2 for Seime 
maximum cardinality of a cover S. 

However, this upper bound on S cannot be achieved. When 
the de Bruijn cycle passes through the node 8 it must Go so 


) 


iy “tiveurerd or asia ae oo G6, lp...) wine wnodes 2(n-1 anda 
are separated by a node in the de Bruijn cycle yet only one 
of them can be in a cover because they are also adjacent in 
the graph. A similar situation exists with the nodes 
qin-1)_y) 27-1, and 2™-2. Thus, the upper bound can be 


reduced by a small amount to something less than half of the 


nodes of the graph. 


a2 


Ihe EXHAUSTION 

Ete we consider modes in the order of a de Bruijn cycle 
and form a set of nodes by choosing those nodes from the 
ordered set that do not create a path with any other 
selected node, the resulting set 1S a maximal independent 
set. Each node in sequence will either be covered or can be 
added to the set of previously chosen nodes. If the nodes 9 
and 2"-1 are also covered, the set is a cover. 

One way to reduce the work of checking that the set is a 
cover is to always start the de Bruijn cycle at the node l. 
ims will force the node 1 to be in the potential cover, 
ensuring that 0 is covered. No loss of generality results in 
choosing 1 to cover @© because, as noted in section II.C, 
either the node 1 or Zito must be in the cover. For any 
cover containing the node gn-l | the set consisting of the 
binary reverse of those nodes also forms a cover containing 
the node l. 

Sealer a,s and the de Bruijn cycle 1, 3, 6, 13, 10, 4, 
pee >, Lbiee7, 15, 14, 12, 8, ©, 1£ we follow the procedure 
described above, we obtain the maximal independent set 
meeec, 16, 9, 7}. Since 7 is in the set, the node 15 is 
covered and the set is therefore a cover. 

However, if we had not chosen 6 but chosen 13 instead, 
memwould have obtained the cover {1l, 13, 4, 5, 7, 12}, of 


greater cardinality than that of the previous cover. 


IA 


To examine characteristics of the different covers of 
the de Bruijn grapn, the following backtracking scheme can 
be used to find all covering sets of the graph for a given 


value of n. 


Algorithm: Exhaustion on Covers 

Sten =a Starting with node 1 on a de Bruga7 
cycle, form a maximal independent set by selecting 
the first node in cyclic order that does not create 
a path with any other selected nodes. When the 
process is repeated, if the node 92> .—) Wie] 
covered, this set 1S a cover. 

Step 2: If the cardinality of the set is 
greater than one, remove the last node brought into 
the set. If not, stop. 

Step 3: Consider the nodes remaining in the de 
Bruijn cycle, beyond the node just removed from the 
Independent set. Add nodes to the set, in order, 
that do not form a path to previously selVecved 
nodes. If all nodes in the graph are covered, this 
set 1S a cover. 


Step 4: Go to step 2. 


We consider again the example n = 4, starting with the 
de Bruijn cycle 1, 3, 6, 13, 16, 4, 93, 27535, > 
12, 8, @. In step 1, we would obtain the set {1, 6G, TO}ueee 


7}, which is a cover. Remove node 7 from the Set "asm 
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step 2. Then in step 3, we would consider the nodes 15, 14, 
12, 8 and 6 to be added to the set (1, 6, 19, 9}. The node 
14 can be added to form the set {1, 6, 10, 9, 14]. This set 
is a cover. 

Golng back to step 2, the node 14 is removed and the 
nodes 12, 8, and @ are considered. None of these nodes can 
be added to the set without creating a path, so we are left 
waeheethe independent set {1, 6, 18, 9}. This set is not a 
cover. 

The node 9 is then removed and the nodes 2, 5, ll, 7, 
15, 14, 12, 8 and © are considered to be added to the set 
{1, 6, 18}. This process continues until only the node 1 
remains in the set. With the Knowledge that the lower bound 
says that the cover must contain at least 4 elements for n = 
4, the process could have been stonped sooner. However, the 
extra checking required to see if the stopping criteria had 
been met 1S more expensive than allowing the program to run 
Zoecompletion. 

Miedetermmining the cost of the algorithm, it 1s noted 
that each node in the graph, except the four nodes adjacent 
to node 1, will each be chosen at some time to be the second 
node in the set. When the jth node in the de Bruijn cycle 1s 
chosen to be the second node in the set, the cost to fill in 
the remainder of the independent set is only slightly more 
Ewen the cost of performing the algorithm on Via st al 


nodes, where V = vile the total number of nodes in the graph. 


Since each node, starting with the third node in the de 
Bruijn cycle, is ultimately chosen to be the second node in 
the set, we can approximate the cost of exhaustively 
Searching a graph containing V nodes by the recursion 

f£(V) = £(V-2) + £(V=3) + 22. 459602) eee 
Here f(x) is the cost of performing the algorithm on x 
nodes. Note that the term f(V-1) is not present in the 
expression. This is because once the first node in the 
de Bruijn cycle has been placed in the set, the second node 
in the cycle is eliminated from consideration as it 1s 
adjacent to the first node in the cycle. The cost funcemae 
applies to every term in the sequence so, we could also 
write the expression 

(Val oye eV 3 a ar eal lee) 
etc.. We introduce the constants ay = 9 and a, = 1 and add 
the trivial term agf(V-1) to £(V ) ewe wen zes sic 

E(V) = a, f(v-1) + a, £(V=2) + Suh (Vos) 2 eee 

ere Ge 
If we substitute the equation for f(V-1) into the expression 


for f£lV)i= wevobtain 


ay) a,f(v-2) + (ag + a,)£(V-3) + (ag + ay) Gyee 
+ ... + (a, + ety) ) EN) + (ay, + ay) te 
BY Gilet ring a, = ag + a), we can write 
Fey) = shee), + eWpel(/ 5) | + a5t(V-4) + ... + ayf(2) 
+ aE) 


A similar substitution of £(V—-2) nto 72 (7 my 2etce 
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f(V) = a,f(V-3) * (a, toeeio 88 G2) as nagmet eas) V5) 


+ we. + (a, tama) = (2) (a, + a5)f£(1). 


Letting a, = a, + ay, we can then write 
£E{V) = ajf(V-3) + a,f(V-4) + a,f(V-5) + ... + anf(2) 
+ a,f(1). 


Continuing to make similar substitutions and defining 
additional constants, we obtain the equation 

ee ee yt vk) + a ft(V-k-1) + a, f(V-k-2) + ... 

zs a, £(2) 5 ete (al) 
after substituting for f(V-k+l). Here a, is defined to be. 
Qa.i9 + a,_,- Continuing with similar substitutions, 
ultimately yields the equation 

f(v) = a,_,£(2) + a,_5f£(1). 

In examining the sequence of ay 's, we see that the 
recursive definition and initial conditions are exactly 
those of the Fibonacci sequence. Thus, a closed form 
expression for ay is given by 
a, = [((1 + /5)/2)% - ((1 - V5)/2)8/N5. 


Therefore, we see that the cost of the exhaustive search 


algorithm is at least O(((1 + ¥5)/2)%) or 0(1.618%) i> 


complexity. 
Piosexiaustives Search was completed for n < 5. The 
results, seen in Table II.1, show that for n = 5, the cover- 


ing set of maximum cardinality is only 75% of the upper 


bound. Also for n = 5, the cover of minimum cardinality 


Zi 


found is larger than the lower bound. Thus, neither the 


upper nor the lower bounds are met for the graph Be. 


An exhaustive search of the graph for a given n will 
TABLE II.1 EXHAUSTIVE SEARCH 


Lower Minimum Maximum Upper 


n Pa: Bound Found Found Bound 
1 2 i 4) G) Ie 
2 4 i ie Ik 2 
3 8 2D 2 2 4 
4 16 4 4 6 8 
5 B2 i 8 i 16 


result in all possible covers of that graph. Because of 
computational constraints, a search was not done for the 
Graphs with n > 5. If a faster computer had been used, 
graphs of larger order could have been searched. However, 
due to the exponential growth of the number of nodes in the 
Graph, using a faster computer would have allowed the 
exhaustive search of graphs only a few orders larger. In the 
Sequel we develop algorithms cheaper than exhaustive search 
to give approximate solutions to both the largest and 


smallest cardinality covers of the graph Bn: 
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Pel. AATMAL COVERINGS 


A. ENERODUCT EON 

In this chapter, techniques are explored to find 
covering sets that are of maximum or nearly maximum 
cardinality. Methods to place nodes in the cover in a way 
that pack those nodes closely together yet does not violate 
the definition of a cover are examined. Also considered are 
Beas tO form a cover for BA based on independent sets from 


lower order graphs. 


B. FRUGAL 

In choosing nodes to form a covering set, .it seems 
reasonable to consider how many previously uncovered nodes 
will be covered when a new node is pDlaced in the set. If the 
meal 1S to Obtain a cover of maximum cardinality, a logical 
criterion for adding nodes to the set would be to choose the 
node adjacent to the fewest, as of yet, UNncOVemed nodes - 


This idea 1s implemented in the algorithm Frugal as follows. 


Algorithm: Frugal 
Step 1. Select node 1l and either node 2 = 2 or 


an-1 


node =e toOmpen im emer Set. (Ents 1s don] to 


ensure that the resulting set will be a cover.) 


Zo 


Step 2. From the remaining uncovered nodes, add 
the node with the fewest uncovered neighbors. In 
case of a tie, add the smallest such node. 

Step 3. If all nodes are covered the set 1s a 


cover, stop. If not, gqomtovsreeor 


In step 1, the number of combinations to ensure that the 
nodes 9 and 2" - 1 are covered has been reduced from 4 ways 
to 2 by forcing the node 1 to be in the cover. As noted in 
section II-F, this causes no loss in generality. 


Once node 1 has been selected to be in the cover, there 


rSsino choles whether te jnelude 2S ae g{n-1) 1 in gene 
cover for small values of n. For n = 1, no cover exists. For 
n= 2, 1s: Ce) cl ecla Somme Ave@ever by itseve. 
Forsn = 3,7: 23S 2 (3-1) 1. However, node 3 is a descendent 
of node 1 and cannot be included, so the node 6 = 2-7 Z 
must be added to the cover instead of node 3. 

For n = 4, the cardinality of the cover is one greater 
if the node oN) l1 = 7 is chosen instead of node 14. For 
n = 5, the cardinality of the cover issone GGpeatenaem: 2> ==2 


= 39 is chosen rather than node 15. Fer 6 < n < Il, elton 
choice of a node to cover 2” - 1 can be made with no 
difference in the cardinality of the resulting covers. 

Most of the time snent running the Frugal algorithm is 
spent in step 2, choosing the next node to be added to the 
set. By using an array to keep track of the number oF 


uncovered adjacent nodes of each node, this choice can De 
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made in O(V) ¢ime. Step 2 will have to be performed m times, 
where m is the cardinality of the covering set. Since V/5 < 
m < V/2, the cost of the frugal algorithm is O(v-). 

This considerable gain in cost over the exhaustive 
search algorithm makes it posible to obtain a covering set 
for much larger values of n. The results of the Frugal 
algorithm for n < 12 can be seen in Table III.1l. The upper 
bound has been adjusted to reflect the results of the 
exhaustive search done for n < 6. Note that the cover 
Produced by the Frugal algorithm meets the adjusted upper 
bound for n < 6. Also, Frugal seems to perform better for 
odd values of n than for even n. In section III.H we further 
compare algorithm performance, considering the cardinality 
of the set produced by the algorithm and the cost of running 


the algorithm. 


TABEE. TIT. d 


FRUGAL ALGORITHM 


Upper % Upper 

n Bound HS ee [eul Bound 

x O G 100.09 

2 } 1 100.00 

5 2 2 1095 .06G 

4 6 6 1900.00 

2) 2 i 109.06 

6 32 ZS 7S 3 

i 04 ae. 82.31 

8 L283 88 SG. 75 

e) ZN 217 84.77 

2 a2 Bhs, 74.62 
li 19024 oO Si» 
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Ce. SEQUENTIAL EE 

In section TI-F, an initial independent set was found by 
choosing nodes along the order of a de Bruijn cycle. Nodes 
were placed in an independent set if they created no path 
with any other node previously placed in the set. The fact 
that a de Bruijn cycle is a Hamiltonian path in BA allowed 
an upper bound to be placed on the cardinality of a cover. 
However, the algorithm made no special use of the properties 
of the de Bruijn cycle except that two consecutive nodes ¥en 
the de Bruijn cycle cannot both be chosen for the 
independent set. This cuts down the time required for the 
search. In fact, the sequence of nodes could be considered 
in any order as long as each of the nodes from 1 to 2 —_ae 
1s considered at some point for placement into 
independent set. 

Because of the ease of coding, the sequence of nodes 
that starts with 1 and proceeds sequentially to 20 = 2o0 
examined with some interesting results. A significant 
difference in the percentage of nodes that forms an 


independent set occurs depending on whether n is even or 


odd. For this reason the two cases are considered 
separately. 
ise Aeoven 


We illustrate the general method by considering the 
case for n = 4. An independent set of nodes in the graph for 


n = 4 is formed by sequentially including nodes in the set 


when no conflict exists. By selecting node 1 for the set, 
nodes 2 and 3 are covered because they have node 1 as a 
predecessor. Nodes 9 and 8 are also covered because they 
have node l as a successor. 

Proceeding sequentially to the next available node, 
4 is chosen. Node 4 covers its successors 8 and 9 and its 
predecessors 2 and 190. Likewise nodes 5, 6 and 7 are 
included in the set, covering their successor nodes 106 
through 15. Their predecessors fall in the blocks 2 through 
3 and 19 through 11, causing no conflict. As no other nodes 
can be added, sequentially considering the nodes ylelds the 
fee t,4,5,6,7}, which is a cover. 

For n = 6, the nodes 1, 4, 5, © and 7 again cover 
the nodes 9 through 15 in essentially the same way that they 
did for n = 4. Nodes 16 through 31 can then be included in 
the set to cover nodes 32 through 63. The predecessors of 
nodes 16 through 31 lie in the block from 8 to 15 and in the 
block from 46 to 47. Therefore, including nodes 16 through 
31 creates no path with nodes already in the set. All the 
nodes are covered and node 63 is covered by 31, so the set 
1S a cover. 

Proceeding sequentially, the nodes chosen for a 
cover for n = k can also be chosen to cover the first 2" 
nodes in the graph for n = k + 2. Nodes 2k ielqne(ehe e/g’ 2 eee Ei 


can be placed in the set to cover their successors, nodes 


ak+1 ak+2 


Enneugn Io im the “graph for n = k + 2, the 


eee 


2(k+l) _ 1) Sas Same 


ee: 2k - 1 and from 


predecessors of the block (2k. 


to the set, lie in the blocks from gk-1 


k isagd 


K+1 EO 2 <a , causing no conflicts. ae 


2k-l 4 9 


7 a 1 are in the independent set verifying 


nodes 1 and ( 
that it is a cover. 

The cardinality of the cover formed in this way for 
the graph n = k + 2 is 2* greater than that for the cover aon 
n = k. Thus, we have the recursion 

f(n+2) = f(n) + 2” 
for n even, where f(n) is the cardinality of the cover for 
B.. By repeated substitutions, this recurrence relation, 
along with the initial condition £(2) = 1, yields the cloges 
fOrm Solution 

f(n) = 2°" = ees 
Thus, for n even, we have a cover with cardinality of 
approximately one third the number of nodes in the graph. 
This is far from the upper bound of half of the nodes in the 
gzvaph. 

The cover for n = k + 2 can also be produced tram 
the cover for n = k in a recursive manner not requiring a 
sequential search. 

For n = 4, we found the cover S$ = {1, 4, 5, 6G, 7am 
We define the second generation descendents of node 1, in 
Bos to be the nodes 4, 5, 6 and 7. The second generation 
descendents of node 4 in B, are the nodes 16, 17, 18 and 19. 


6 


Continuing in this way, we see that the second generation 
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descendents of all the nodes in the cover formed by the 
Sequential fill algorithm for n = 4, form the set [4 - 7, 
ie —- 31} in Be. Adding node 1 to this set yields the 
covering set formed by the sequential fill algorithm for the 
case n = 6 previously. 

In general, the covering nodes in the sequential 
fill procedure for n = k + 2 are the second generation 
descendents, in B45, of the covering nodes in the 
sequential fill procedure for n=k, together with the node l. 


This leads to the recursion 


e(n 


Selving this recursion, with the initial condition £(2) = 1, 
again yields the closed form solution 


f(n) = (2% - 1)/3 


After seeing the pattern that developed in forming 
these covers, a very inexpensive algorithm can be developed 


to generate them with only a small amount of storage. 


pide rtthme sequential Fill; n even 


Step 1. Place the node l ina queue. 
Step 2. Remove the first node from the front of 
mine queuc, Call it x, and add it to the cover. 
; (n=2) } 
Step 3 If x is equal to 2 , remove the 
remaining nodes from the queue, add them to the 


cover and stop. If not, go on to step 4. 


Sys) 


Step 4. Find the second generation descendents 
Of aL Bae i.e. 4x, 4x + 1, 4x + 2 and 4x peor ae 
these to the back of the queue in ascending order. 


Step 5. GO°-to stepe. 


The number of multiplications done in step 4 is 
approximately one fourth the number of nodes in the cover, 
Or one twelfth the number of nodes in the graph. Thus, we 
have a very fast, O(V) algorithm. Unfortunately, as noted 
before, the cardinality of this cover is not very /eclosemmam 
the. Upper’. bound “or Vithe solutions found by previrame 
aleoririnis. 

Ze ate 

For odd values of n, the concept of forming 
independent set by sequentially considering nodes of the 
de Bruijn graph works in the same way as it does for n even. 
Starting with node 1, a node is includ2d in the set if it 
does not creats a path with another node in the set. 

For n = 3, node l is placed in the set, covering. 
successors 2 and 3 and its predecessors J and 4. Proceeding 
Sequentially, 5 is the next free node. Node 5 covers its 
successors 2 and 3 and its predecessors 2 and 6. The next 
uncovered noce is 7 but it cannot be included in the jse 


for it allows a path to i1tscltloetnewero--, the set 


{ 


S = {1, 5} is a maximal independent set but not a cover. 


Loe 


However, 1f node 5 is deleted from S$ and node 6 replaces 5, 
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the set S = {1, 6} would still be an independent set and all 
nodes in the graph would be covered. No nodes become 
uncovered as a result of exchanging 6 for 5 for the 
following reason. With the exception of node 6, all of the 
nodes adjacent to 5 are smaller than 5. Thus, these nodes 
were considered for inclusion in the independent set prior 
to node 5, but were rejected. Therefore, they must be 
covered by nodes in the set other than 5. 

In Bo, we begin by placing node 1 in the set. Nodes 
2 and 3 are covered by node 1, so we skip them and go to 
node 4. Nodes 4, 5, 6 and 7 are included in the set, 
covering their successors, nodes 8 through 15. Node 16 is 
covered by node 1, so we consider node 17. Node 17's 
Successors are nodes 2 and 3 and its predecessors are nodes 


8 and 24. None of these is in the set, so 17 1s added. Nodes 


18 and 19 are predecessors of nodes 4 - 7, so they are 
excluded from the set. The successors of nodes 20 =- 23 are 
the nodes 8 - 15. Their predecessors are the nodes 19 and 11 


and the nodes 26 and 27. None of these nodes is in the set, 
so the nodes 20 - 23 are added to the set. Node 24 is a 
predecessor of 17, so it is not included. Node 25's 
successors are 18 and 19 and its predecessors are 12 and 28. 
Since these nodes are not in the set, node 25 is added. 
Nodes 260 and 27 are excluded because they have successors 
23 - 23. Node 28 is excluded because it is a predecessor of 


25. Node 29 can be added because neither its successors, 26 


Of 


and 27, nor its predecessors, 14 and 39, are in the set. 
Finally, node 39 is excluded because it has just been 
covered by node 29 and node 31 1s excluded because it makes 
a path with itself. This yields the maximal independent set 
S = {1, 4-7, 17, 29-23, 25, 29}. Neither node 15 nor nodemem 
is in the set so S is not a cover. However, again no 
conflict arises if node 29 1s replaced by node 30. )iijume 
resulting set is still independent and covers Be for reasons 


Similar to that discussed concerning the cover for B.. 


Oo, Il, 2, 3, 4, 5, %6, 7, 8,9,10,11,12 )0 ee 


Figure ier. | 
Nodes in Be with Sequential Fill Set Underlined 


In Figure If1.1, 9 the “neodecmerna Be are written jm 
order from 8 to 31. The nodes in the independent set formed 
by the sequential f111 algorithm have been underlines 
Starting with node 9, we observe that one node, 6, was 


excluded from the set, then one node, 1, was includsaa 


n=xt two nodes, 2 and 3, were excluded, then four nodes, 
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4, 5, 6 and 7, were included and the next eight nodes, 8 - 
1S 3, were excluded. Now the pattern begins anew with one 
node, 16, excluded, one node, 17, included, two nodes, 18 
and 19, excluded, then four nodes, 280 - 23, included. 
Starting over again, we exclude one node, 24, include one 
node, 25, then exclude two nodes, 26 and 27. This pattern of 
partioning the nodes continues, each time stopping when the 
Size of the last segment is half the size of the previous 


last segment. For n= 5, the partition sizes are l, l, 2, 


We see that the first sixteen nodes in Be ace 
included in the first pass. The next eight nodes are 
included in the second pass. Four more nodes. are 
partitioned on the third pass. Two nodes are partitioned on 
the fourth pass. One node is partitioned on the fifth pass 
and one more on the sixth pass. 

In general, the pattern observed in Be Oeceurs in BO 


gn-l _ 4 are 


n-2 


for each odd value of n. The nodes @ through 
ae ; 6) 4) ik 2 
Pape itloned im segments of size 2°, 2°, 2, 2°, «.«- 2 
The first node in the pattern, 8, and all nodes in Segments 
that are an odd power of 2 in size are excluded from the 
independent set. The nodes in segments that are an even 
power of 2 in size are included in the independent set. This 
partitions the first half of the nodes. Then the nodes gn-l 
a | ee ag a. 
mmrough 2 jee - l are partitioned in segments of s1ze 


; 2°, .«. 2"72, the node 2771 and all nodes in 
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segments that are an odd power of 2 in size are again 
excluded from the independent set. The nodes in segments 
that are an even power of 2 in Size are again included in 
the independent set. This partitions the next fourth of the 
nodes. This process 1s repeated with one fewer segment in 
each application until all of the nodes in the graph have 
been partitioned. Proceeding in this way, one half of the 
nodes are partitioned on the first pass, one fourth on the 
second pass, one eighth one the third pass, etc.. 

For the nodes ¥Y through gn-t - l, each segment of_k 
nodes that is added to the independent set has a 2k long 
Segment of successor nodes subsequent to it which 1s 
excluded. Thus, each segment is twice as long as the 
previous one and only every other segment 1s allowed in the 
set. This pattern 1s broken by the node 20a which is @ 
predecessor of node one, but the process repeats. 

Each segment of k > 2 nodes in the first half of the 
graph also has a corresponding k/2 long segment of 
predecessor nodes in the next fourth of the graph. These 
nodes, from 277! tq 2M-l 4 29-2 _ 1, are either excluded 
from the set or included in the set depending on whether 
their successor segments in the first half of the graph are 
included or excluded respectively. The nodes that are added 
to the independent set between 2n-1 ang 2271 + 297-4 - 1 nave 
predecessors in the first half of the graph and in the next 


elghth of the graph. The predecessors in the first half all 
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Seal | MW seqnenes that have been excluded from the set. Thus, 
the pattern of every other segment being added to the set 1s 
repeated with each segment being twice the size of the 
previous one. The pattern is broken by the node gn-l , gn-< 
which is a predecessor of node gn-l + l. 

In a similar manner, each segment of K/ 2 2 nodes 
ErOm gn-l ete) gn-41 + 2n-e - l has a corresponding k/4 long 
segment of predecessor nodes in the next eignth of the 
graph. The inclusion of these segments is also determined by 
the status of their adjacent segments in th2 previous 


fourth. The nodes that are added to the independent set 


between 2971 4 gn-2 and ‘gn-l + gn-2 + gn-3 - l have 


predecessors in the first half of the graph ari in the next 
sixteenth of the graph. The predecessors in the first half 
also fall in segments that have been excluded From the set. 


We continue in this manner until all of the nodes in 


the graph are covered except 2" - 1. However, the node 
2" ~ 3 has been included in the set and can be replaced by 
the node 2" - 2. This causes no conflicts and results in all 


the nodes in B_ being covered. 
To find the cardinality of the resulting cover we 
Simply add up the number of nodes in the segments that were 
included in the independent set. For n = 5, we have 
Is] =1+4+1+4+1 +1212 
The first pass through the partitioning pattern 


places 1+ 4 = 5 nodes in the independent set. The second 
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pass also places 1 + 4 = 5 nodes in the independent set even 
though there was one less segment cosidered than in the 
first pass. Looking again to figure III-.l, we see that the 
last segment in the first pass was not included in the 
independent set. So, even though the second pass partitioned 
only half as many nodes as the first pass, the number of 
nodes added to the independent set was the same. It will be 
the case that every even numbered pass will place as many 
nodes in the set as the previous odd numbered pass. 

In general, for any ‘odd value of n, the cardinality 


of sequential fill set 1s given by 


Gi (29 Dee. eee gn-3) + (2% + 2° + Oe gn-3) 
rs (29 ed ere: A. a 2n->) + (22 pop ae 2n-5) 
+ (2° + 2? 5 a 2n-7) + (2° + 2° + + gn-15 
+ 
+ (2% + 22 ) + (2° r 2°) 
a + 2° 
Letting k = (n - 1)/2 to facilitate indexing, we have 
oe 54 k-2 > 9) >: 
IS] = 2 >) 2*) + 2 9) 20) ee ee 
j=6 j=0 j=0 
or 
ie kK-1 ae 
[Sip= 2a Pac 
EL 4 
Sa 2{ earn) 
Since x Doe 2 : ~ 1)/3 , the cardinality "oes 
ya 


1S given by 
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K | 
Bilwcsmee eos TD) _ a ai3 
1=1 


This summation ylelds 
eae 1) = 6k1/> . 


Finally, substituting k = (n = 1)/2, we obtain 
ieee? = = Bn -5]/9 


As n grows, the fraction of the number of nodes in 
the independent set goes to 4/9. This is quite close to the 
Moper bound of 1/2. 

For the case of even values of n, i1t was observed 
that the second generation descendents of the nodes in the 
independent set for n = k appeared in the set for n = k + 2. 
This allowed us to determine the een ce of the 
sequential fill sets recursively. The same methods can be 
applied to odd values of n to yield the sequential cover 
without going through the sequential search of the nodes of 
Bae 

EOumiu—  S,eeene nodes. 4, 5.6, /;, 23, 21, 22, and 23 
are the second generation Jaseeaclanee of nodes 1 and 5 (the 
independent set for n = 3 formed by the sequential fill 
algorithm). These nodes, together with the nodes 1, 17, 25 
and 29 form the sequential f1i11 set for n = 5. These 
additional nodes are the nodes of the form 1, (1 + gn-ly 

n-l1 , 
OMe oly omm4) i, me )} for odd values of n 
i= 


greater than 1. For each odd value of n, we add a total of 
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n- 1 nodes to the set of second generation descendents of 
the sequential fill .cover for the previous odd n. Letting 
f(n) be the cardinality of the set formed by the sequential 
fll “algorLttnm stor ae nm an Odd integer, we have the 
recursion 

f(n) = 4£(n-2) +n - 1 
with the initial condition f£(3) = 2. Solving this system by 
substitution, we get the same closed form expression 
obtained earlier 

£(n) = (22°? .63n = Shoe 

The following algorithm takes advantage of ths 

Structure of the sequential fill sets for odd values of n by 


extending the independent set for Bi-2 to form the set for 


BAe In doing so, we must start by forming the set for n =e 
S = {1, 5}. From that set, we form the set for n = 5, s@hen 
for n = 7, and so on till we reach the d2sired value Game 


When the last node added to the independent set is 
incremented by one, the set will be a cover. The set formed 
in this manner 1S th2 same set that is obtained by 
sequentially considering Tees in the graph and includiie 
them in an independent set if no path is created. However, 
checking each node to see 1f one of its four neighboring 


nodes is in the set is much more time consuming. 


Algorithm: Sequential hig mec 


Step 1. Initializeme tosbesoe 


Step 2. Place node 1 in the set. 
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Step 3. Complete the set by adding the nodes 


k-l . 
ea 2 meee 25 eo yee, (1 + +b) 27). 
Jae 
Step 4. If k = n, increment the last node 


placed in the set by one and stop. If not, go on to 
step 5. 

Step 5. Increment k by 2. 

Step 6. For each node x in the independent set, 
replace ay the nodes 4x, 4x + 1, 4x + 2 and 4x + 3. 


Step 7. Go to step 2. 


The multiplication aone in step 6© is the dominant 
MmeretOr in the cost analysis of this algorithm. Since one 
multiplication 1s required for each node in th2 previous 
set, this requires O(V) time each pass through the loop. To 
obtain the cover for B., approximately n/2 passes must be 
made. Therefore, the algorithm is O(nV) or O(VlogV) in cost. 
This gives us a fairly fast method of obtaining a covering 
meee tnat 15 Close to the upper bound in cardinality, for odd 
values of n. 

Since multiplying a number by 4 is just a left shift 
of its binary representation by 2 places, if the programming 
language used permits the manipulation of the actual storage 
bits this operation is rather trivial and the algorithm 


becomes even faster. 
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D. 3-COLORING 

The chromatic number of a graph is the smallest number 
of colors that are required to color the nodes in the graph 
so that no two adjacent nodes are of the same color. The de 


Beulah gram 8 for n greater than 1, contains two Cyelee 


n? 
of length 3. Therefore, the chromatic number of BO is aw 


least 3. 


Theorem III.1 

The senLOomeat re mumbes ie BA 1S. .3. 
PLOOr- 

Due to the existance of 3-cycles, we see that 
the chromatic number of BL is at least 3. Conside: 
the nodes of the graph labeled with their binary 
representation. We can color the nodes with three 
COLOrEs by coloring the nodes whose binary 
representation ends with an odd number of ones with 
the color A, coloring the nodes ending with an odd 
number of zeros with the color B and coloringwa 
ether nodes with the colerre.- 

The nodes colored A have a successor colored B 
and a successor colored ¢C. Nodes colored B have a 
successor colored A and a successor colored C. Nodes 
colored C have a successor colored A and a successor 
colored B. (The nodes @ and 2” = 1 are adjacent ee 


themselves but this causes no problem). 
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Let us examine the set of nodes colored A. The nodes 
that end with a single 1 have a 939 in the next to the last 


bit. Since each of the remaining n - 2 bits can be either a 


G6 or a 1, there are gn-2 nodes in BL whose binary 


representations end with a single 1. Likewise, there are 


22-4 nodes ending with exactly three 1's. In general, there 


are gn-2k 


nodes whose binary representation ends with 
Eeectly (2k = 1) L's. To find the cardinality of the set of 
nodes colored A, we simply add up the number of nodes ending 


With an odd number of l's. 


For even values of n, summing qn-2k Ou eK = 1; A awl 
yields 
lAl = MO yn-2k 
k=l] 
= (2% - 1)/(4 - 1) 
See =) 1 1/35 
For odd values of n, the summation ends at k = (n-1)/2. 


This accounts for all the A colored nodes ending with 
(nm - 2) 1's or less. However, since n is odd, the all 1's 
nm-tuple is also colored A. Thus, 


(n=1)/2 
LA] gn-2k 


1 + 
k=1 


hea 2° =) (4 =D) 


(2e ee any /e3 
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The nodes that are colored B are the binary complements 
of the nodes colored A. Thus, IBIl = |JAl. We can find the 
cardinality of the nodes colored C by subtracting twice the 


cardinality of the nodes colored A from 2". For even values 


of n, we have IAI = IBI = (2? - 1)/3 and Ic] = (2" + 2)/3. 
For odd values of n, we have |Al = {Bi = (2" + gue 
Ic] = (2" = 2)/3. Thus, this coloring partitions =n= meee 


of B,| into 3 sets, each having approximately a third of the 
nodes of the graph. 

Since each node is adjacent only to nodes of the other 2 
colors, any set of like colored nodes is adjacent to all the 
nodes in the graph. If this set is independent, it will be a 
cover. A given colored set will fail to be independent if it 
contains either the node 39 or the node 20 ~ 1. For even 
Ordered graphs, the set colored A forms a cover as does the 
set colored B. For odd ordered graphs, th2 set colored 
forms a cover. However, the cardinalities of these covers 
are not very close to the upper bound, nor are they as large? 
as the solutions found by previous methods. 

We count the number of nodes of a given color by a 
recurSive process. 

The binary representation of each non-zero node in BL 
ends in either k 8's, where k <n, Or k 1 S, where Kise 
Each node is colored according to this @nding. Whenua. 


nodes in BL are viewed as nodes in Ee the ending of their 


+1! 


binary representation 1s unchanged. Thus, the nodes are 
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colored the same color as they were in Bo: For instance, 
the binary representation of node 7 in By is lll. Because it 
ends in an odd number of l's, it is colored A. The binary 
representation of node 7 in B, is 8111. It still ends in an 
odd number of l1's, so node 7 is colored A in By. iis, Once 
a non-zero node is colored, it will remain that color in all 
higher order graphs. (The node J will alternate between the 
color B and the color C, depending on whether n 1s odd or 
even). If we can determine the number of nodes that are 
added to a colored set as result of increasing n, the 
cardinality of that set can be found recursively. 

We consider the recurrence relationship only for the 
m@ereginality of the set A. As was seen earlier, once this has 
been determined, the cardinalities of the sets B and C can 
be obtained easily. We again consider the cases for n a4ven 
and n odd separately. 

First, we consider the case where n is @ven. Recall that 
the nodes colored A are those whose binary representation 
ends with an odd number of 1's. Because n 1s even, we know 
mmat each node in B, colored A has a 8 somewhere in its 
binary representation. One of these O's ends the final 
string of l's. Then, for each of these A colored nodes in BO 
tO be viewed as a node in Poe ee ences tO place elther 
a 9 or al in each of the two additional bits in the binary 
representation in B+2° Thus, for each A colored node in B 


there are four A colored nodes in Bat? with the same last n 
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bits. In addition to these, the node that Nas ao.) im seme 
highest order bit, followed by (ntl) 1's, will also be 


colored A. If we let f(n) be the number of nodes Coloreqm 


in Bae we have the recurrence relation 
f(nt+2) = 4£(n)}> ee 
for n even, with £(2) = 1. This same recursion was some 


previously to yield 
€ (yn) See ee 
For odd values of n, the situation 1s similar to theca 
even n in that for each A colored node in aie there avreme 


nod2s in 3 with the same last n bits also colecreates 


However, we need to pay special attention to the all 1 node 


in Bo | ROY this node, we ar2 not free to choose any 


combination of O's and Il1‘s in the two additional bits ae 

Bt2° Since n is odd, the node that starts with a Ul ane 

then a string of n 1's is a color C nods. SO, we  omaee 

Subtract it from our count. This gives us tne recurocmen 
f£(n+2)/= 46 a) =o 

for n odd, with the initial condition £(3) = 3. ThE So tiie 


to this system ts 


fin) = (2 4a ee 


jae DOUBLING 
In section TII.C, we observed that an independent set in 
B42 could be formed by mapping each node in the sequential 


FL ISGCOverimaog rset, ©. B to its four, second ge2neratwen 


5U 


descendents in B This independent set can be made into a 


mez 
cover by adding selected nodes to it. Thus, the cardinality 
of the resultant cover in B +2 is more than four times the 
cardinality of the initial cover in Boe This result leads us 
to search for similar relationships between covers in BO EV oul 
in Boel 


Two methods are discussed in this section which map a 


cover in B_ into an independent set in B Geonaitions 


n n+l ° 


under which this set can then be made into a cover are 
presented. For each method, the cardinality of the cover in 


B Will be at least twice the cardinality of the cover in 


ile : 
DBUGeeSssSOrs in are | 
The first method 1S very sSimllar to the pattern 
observed in the sequential fill method of forming a cover. 


Seven A cover S in 34¢ an independent set S’ can be formed 


in B,; by mapping the nodes in S to their successor nodes 
Her Ce That is, each node x, in S, will be mapgoed to the 
meaes 2x anaw2x + 1 Pn 341° Note that there is no modulus 


reduction necessary as the n-tuples become (ntl)-tunles by 
morse doubling. Thus, the cardinality of S’' is twite@e that of 
S. If the nodes 9 and gn+l - l are covered by nodes in S', 
then by adding any uncovered nodes, S' can be made into a 
Bovern, for ee 


For example, the nodes of the covering set S = 


4) in B,, are mapped to their successors in 8, forming 


oak 


s' = {6, 7, 8, 9}. The set S”" is an independenuma mea Ba. 
By adding either node 5 or 16 to S*, it is made ityeeeee 
cover. 

The following theorem proves that the set lin Pa 
formed by the successors of an independent set in ce is 


always an independent set. 


Theorem III.2 


If the set S = Soar Xp cee 8 xy J LS. 4m 
independent set in B,, then the set S' = (2x1, 
2x) PO 2x pac 2k ey 2X1, 2x, + 1} is) ae 


independent set in B atl: 
PrOOr: 


Assume not. If S' is not an independent set in 


Dee then there exist nodes x, and x in S susan 


that one of the following six cases 1s true. 


Case l. The node 2X 5 is Commected to@itse liiwun 
B 
n+l ° 
For this to be true, the node 2X must be a 


descendent of itself. Thus one of the following 


supcases must hold. 


+ 
a) 2x. = 4x. (mod 277) 
1 i 
TALS 1s true if, ‘and, only ai Ga 9. However, 
Since S is assumed to be independent, xi = 8 cannot 
be in S. 
b) 2x, = 4x, + 1 (mod 2 


a2 


This subcase is eliminated because 2X5 (mod 


ntl) 


gntly 1S moda. 


1s even, while 4x. + 1 (mod 2 
Thus, the two cannot be equal. 


Therefore, Case 1 cannot hold. 


foe 25 tne aede 2x, + 1 is connected to itself 
at Batl’ 
By an argument similar to that of Case 1, »this 


implies that x; = 2% - 1, which cannot appear in an 


independent set in Bae 


Case 3. 2x, PeecOumecred, tO 2x mail. 
This implies either that x, = 2% ~ 1 or that 


Xs =- 0. Nertner condition can hold if S 1s an 


independent set in BL: 


ase 4. 2x. j r ei 2 
Cas xX, is connected to ex, in Biiy 


By reducing Case 4 to its subcases, we see that 


a must be connected to x; in Be fer this®tor nord. 


However, this cannot occur in an independent set. 


Case 5- 2x, is connected to 2x. + 1. 
Again, this implies that a path exists in o 


between the nodes Xs and X 3 


Case 6. 2x5 + 1 is connected to 2x. + l. 
As before, a path must exist in B, for this to 
be true. 


On 2D. 


3 


Since, by definition, a cover is an independent set, 
Theorem III.2 holds for all covers. By mapping the nodes in 


a cover S in BL to their successors in 8 we form an 


n+l’ 
independent set S'‘' whose cardinality is twice that of the 


cardinality of S. If the end nodes, @ and 
covered, S‘* can be mad2 into a covering set by adding any 
uncovered nodes of B_,, to it. As discussed in section II.C, 
the nodes 9 and 2"+! ~ 1 cannot be included in the ceovenumag 
re because of the path they make with thems@ives. 
Therefore, they must be covered by neighboring nodes that 


are in the cover. Thus, whether an independent set S' can be 


made into a cover or not depends on whether the nodes 9 and 


Dee le ie l1 each have a neighboring node that can be included 
in-S. This isnot. always the scase- 

Starting with the cover S = {176}, in B,, we obtain 
the independent set S' = (2, 3, 12, 13}, in Bye Node 1 


cannot be added to S‘' because nodes 2 and 3 are present. 
Node 8 cannot be added because it creates a path with node2 
12. This leaves us with no way to cover node 9. Thus, this 
set cannot be made into a covering set. 

As a second example, we begin with the sequential 
FAI Cover: Sor in (1, 4, 5, 6, 7}. By takingueeae 
Successors of S in B., we form the independent set S' = 
(2, 3, 8, 9, 18, 11, 12, 13, 14, 15})% Once aigai nye eee 


does not allow the node 38 to be covered. Node 1 cannot be 
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added to S* because it is adjacent to nodes 2 and 3. Node 16. 
cannot be added because it is adjacent to node 8. Therefore, 
S' cannot be made into a cover. 

The following theorem gives certain sufficient 
conditions for which the end nodes are guaranteed to be 


ontl 


covered by S‘’. With the nodes 9 and 1 covered, S' can 


be made into a covering set by adding any uncovered nodes to 


Pt. 
Theorem III. 3 
If the nodes 2971 ana 297-1 - 1 are in a cover § 
in Boe then an independent set S* formed in ena 
from the successors of the nodes in S, will cover 
the nodes 8 and 2™tl - j, 
EGOOL: 
By Theorem III.2, S‘' is an independent set. 
Since the nodes gn-l and gn=l = } are in S, their 
respective successors, nodes 2a se lewd, be 
in S'. Thus, node 9 will be covered by 2” and node 
antl _ 1 will be covered by 2% - 1. 
OnE. D. 
Another sufficient condition is given by Theorem 
ient. 4. 


Theorem III.4 
If the nodes 1 and 2" = 2 are in a cover S in 


Bee then an independent set S'‘' in Bat] can be formed 


De) 


n+t 1 wil be” covered. 


such that the nodes 0 and 2 
Proof: 

If the nodes 1 and 2" - 2 are in S, S' can be 
formed in the following way. Map the nodes in S to 


their binary reverses in BA and call that set R. R 


will also be a Cover Tin B,- The mapping will take 


node 1 to node 2.7! ana node 2% = 2 to node 
De Sey ay Mone by Theorem III.3, we see Vtnatusam 
taking the successors of R in B atl! we form an 
independent set in which the nodes 9 and pe - l 


are covered. 


Op Eon 


Thus, by Theorems III.2, II1I.3 and tT1iv4, fia. 


Opposite corners of the de Bruijn graph (nodes 1 and 20 =e 
n-l n-1l , 

Or nodes 2 and 2 - 1) are in a cover for Boe a cover 

fOr B+] can be formed that is at least twice as larg? in 


cardinality as the cover for Bo: 
2.~— €Ompanion: Pains 
Consider an independent set S in B.. By €hs 
following lemma, S is also an independent set in Boat: 
Lemma III.5 
If S 1S an independent set in Ba then S 1s an 


e 


independent set in aa 


ProGeE: 
Assume not. If S is not independent in Bel 


then there exist nodes x and y in S such that x 1s a 
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successor of y in BA This can be true in one of 


lls 


two cases. 


Case l. x = 2y (mod gntl) 
; = n+l 
An equivalent statement 1s x = 2y + k2 Or 
that x = 2y + (2k)2", where k is an integer. 
However, this implies that x is a successor of y in 
ae, which contradicts the premise that 5S is 


independent in Bae 


Case 2: x = 2y + 1 (mod ALS 


By a Similar argument, we see that for this case 
to be true, we must have x = 2y + 1 (mod 2%), which 


again is a contradiction. 


In section I1.B, th2 companion of 4 node x was 
defined as the node whose binary representation agreed with 
that of x in all but the highest order bit. Since th2 nodes 
in S are in the graph 3_, they can be repressented by binary 
n-tuples. When viewed as nodes in Boal their binary (n+tl)- 
tuple representation has a O in the highest order bit. Thus, 
the companions of the nodes in S, viewed as nodes in Boe: 
will have al in the highest order bit. In other words, the 
companion of a node x, in S, 1s the node x + Dee 

Given an independent set S, we form the set S'‘' from 


the nodes in S and their companion nodes in 8B By Theorem 


n+l° 


IIIl.6, we see that S' 18S an independent set in B +1: 


a7, 


Given an independent set S, we form the set S'‘ from 


the nodes in S and their companion nodes in B By Theorem 


n+1° 
III.6, we see that S‘ is an independent set in Be 

Theorem III.6 

If S = (x,, Xz see x} is an ind2pendent set in 
Boe then the = set. o gee LXpreees Xe Yyrcess ¥,}. 

= n ) ) a j 

where cet esd + 2°, 18 an independent set in Bae 
PEOOE: 


The proof 1s essentially equivalent to the proof 


Of Theorem 1ii.2Z: 


As an example, consider the cover S = {l, 6}, in B.. 
S = i1, 6} is also an independent §seuern B,.- We add the 
nodes 9 and 14 (the companions in By of 1 andue 
respectively) to S to form the independent set S' = [{1, 6G, 


9, 14}. If we add either node 5 or node 108 to S‘', it becomesd 


Since Theorem III.6 holds for any independent set, 
we have a second method of forming an independent set S'‘, in 
oe ae ErOm an .COver, oS, in 83,°: Again the cardinality of S’ 1s 
twice the cardinality of S. If the nedes™oeand pu - l are 
covered, then S‘' can be made into a covering set by adding 
any uncovered nodes. Again, conditions exist for which the 
end nodes cannot be covered. 

The set S = {1, 4, 5, 7, 12, 13), is a Cover ieee 


4 
By adding nodes 17, 20, 21, 23, 28 and 29 (the companion 
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nodes in B, of the nodes in S), we form the independent set 
on eee, 7,082, 1S, lie 23,2) 23, 28, 29}, in B 
However, node 31 cannot be covered by adding nodes to 5 
Node 15 conflicts with nodes 7 and 23 while node 34 
conflicts with nodes 28 and 29. Thus, the set S‘ cannot be 
made into a cover in B. by simply adding nodes. 

Theorem III.7 gives a sufficient condition so that 


gntl 


the nodes 9 and l1 are covered by nodes in S'°‘. [In that 


case, S'‘' can be made into a cover in ery by panidimg any 


uncovered nodes to the set. 


Theorem III./7 


If the nodes 1 and 2" - 2 are in a cover SS, St 


By then the set S°, formed from the nodes in § 
longa with thelr companion nodes in Cee «Pee 
: Shay le 

independent set in Bee and the nodes 93 and 2 - l 


will be covered. 
Eeoo ft : 

By Theorem III.6, S’' 1s an independent set in 
ee) Suniee, moage | 1s in S, if will be in S°>. Thus, 


the node 39 will be covered by node l. Since the node 


ems in S, 2° = 2 EB) 1ts companion node in 
: : oa} 1] ~ n n+l i 
Baty: Will be in 3’. But, 2 Zee 2 a; 
which covers the node gntl - l1insB : 
at+l 
Oe hee & 


2), 


Theorem III.&8 provides another sufficient )Gond Pema 


that nodes in S* will cover the end nodes ore. 


n+l1° 
Theorem III.8 
If the nodes gn-l and gn-l - 1 are in a cover S, 
in Boe then an independent set S', in Bat: can be 


formed from companion pairs such that the nodes 9 


and gntl - l1 will be covered. 
Proof: 
If the nodes 2771 ana 2m-l _ l are in S, tieeeene 


nodes in S to their binary reverses in 8, anc call 


that set R. R will also be a cover iin Boe The 


mapping takes node pas to node 1 and node gn-l - l 
Pon mede v2. jo. Then, by Theorem IE1I.7, we see than 


by adding to the set R its companions in Bo4): we 


form an independent set in which the nodes 9 and 


r 
9" Le l-are coverec. 


Continuing our previous example, the set S* = 
fl, 4, 5, 7, 12, 13, 17, 20, 21, 23, 28, 293° YS" nee oe eee 
in B, because it fails to cover node 31. However, if we 
replace nodes 28 and 29 with node 39, S' would satisfy all 
the requirements of a cover. No more than 2 nodes can 
conflict with a corner node that 1s needed to cover an end 
node. Thus, in exchanging nodes in S’ with a node in Be 


(such as was done in the example), the cardinality of S' 


Wlilwdecrease no More than. 


oO 


Thus, given a cover S, in B., we form an independent 
pete S 5 in Bugz, such that the cardinality of S' is twice 
that of S. This can be done either by mapping the nodes in §S 


to their successors in B r by adding to the nodes in §S 


n+1 © 
their companion nodes in B We can always choose a 
doubling scheme so that at least one of the two end nodes 1s 
covered. If the other end node cannot be covered without 
Creating a conflict, the appropriate corner node can be 
Meougnt into S* to cover that end node and the conflicting 
node(s) deleted from S‘'. In this way, S' can be made into a 
Sever in Bee ee Cardinelity Snemvless Sirhan twice “the 
@erdinality of S minus l. 

For odd values of n, covering sets of approximately 
4/9 of the nodes in the graph are obtained by the sequential 
Fill method. By the doubling theorems, covering sets can be 
found in even order graphs that also have cardinality of 
approximately 4/9 the number of nodes in the graph. As noted 


tore, this 15 guite close to the upper bound of 1/2 the 


nodes in the graph. 


F. POUBLING SCHEMES 

In section V.E, two general methods of doubling were 
discussed. In each method, an independent set S' was formed 
in B., from an independent set S in Bu-1° 5S was then made 


into a cover of B. by adding selected uncovered nodes. 


ol 


to a cover of near maximal cardinality when applied to one 
starter set might not produce as large a cover when applied 
to another set. Likewise, a starter set might produce 
favorable results when doubled by one method but might 
produce poor results when doubled by the other method. 

In this section, two schemes for doubling are presented 
to form a cover in B.. For each scheme, we discuss the 
starter set S, the basic doubling method used and the 
process of covering the remaining nodes. The schemes vary 
for even and odd values of n. These schemes both yleld 
Covers that are close tO ENG —Ugeer secure. yet are 
inexpensive to perform. 

1. Double and Redouble 

a. on even 
Por even values of n, consider the set of nodes 


GOloréed wA- iby “the 32=coloring jim Bel: This set 1S “Wiese 


Lg Veil 
independent in 3B._4,, for it contains the node 2 = 


Thus, we delete that node, making the set indepenceaes This 
1s the starter set S. 

We double S to form S3' by forming the successors 
of the nodes of S. That is, map each node x in S toma 
nedssa2x and 2 Gein B,- Note that there is no modulus 
reduction necessary as the (n-1l)-tuples become n-tuples by 


this doubling. By the doubling theorems in section V.E, we 


know that S' is an independent set in Boe 


reduction necessary as the (n-l)-tuples become n-tuples by 
this doubling. By the doubling theorems in section V.E, we 
know that S' is an independent set in BS 

We take a close look at the nodes in S‘. Since 
the nodes in S are colored A in Bo-j. their binary 
representations all end with an odd number of l'‘s. When 
these nodes are multiplied by 2, we obtain B colored nodes 
in B that end with an odd number of 1's and one 0. 
Moltiplying the nodes in S by 2 and adding 1 yields the C 
colored nodes that end with an even number of 1's. The only 
nodes of these types that are not in S' are the descendents 
of the node 2n7! - 1, which was deleted from S. These are 


the nodes 2" - 2 ana 2" - 1. 


It is noteworthy at this point to mention that 


neither the node @ nor the node 2" - 1 is covered by the 
nodes in S'. Node 9 can only be covered by node 1 or node 
qn-l1. However, node 1 is colored A and is not in S‘. Node 
22-1 is colored B but it ends in (n-l1) 9*s, so it is not in 
S' either. Likewise, node 2” - 1 can only be covered by node 
gn-l - 1 or node 2” - 2. But, node a l is colored A and 
is not in S' and we have already seen that node 2" - 2 is 


m@e in SS‘. 


We cover 2" = 1 by adding 2" - 2 to 5'. We have 


just shown that its predecessors, 2°71 - 1 ana 2" - 1, are 
not in S‘'. We now need to ensure that the successors of 
2™°- 2 are not in S' for S' to remain Lndependent. 


The node 2" - 2 is colored B. Recall that 
companion nodes have the same binary representation except 
in the highest order bit. Thus, unless a node consists of 
all 96's or alli 1's, it will be the same color as its 
companion. So, the companion of 2" = 2 is also colorequsee 
Thus,the successors of 2” - 2 descend from nodes that are 
both colored B. However, the nodes in S' descend from nodes 
colored A. Therefore, the successors of 2" = 2 are not mmeE 
so it 1S not covered and can be added to S’. 

The set S‘' now contains 

1.1 All B colored nodes ending in an odd number of 
1's followed by one 98. 

1.2 All C colored nodes ending in an even number 
Gfen orexcopt. 2 <6 i. 

We consider how well S'‘' covers BA. Certain ine 
the A colored nodes in = that ape less than gn-l are 


covered by the nodes in S' because they are predecessors of 


the nodes in S'. The companion nodes of the A colored nodes 


an-1 n= 


less than are the A colored nodes greater than 2 
These nodes are also predecessors of the nodes in S'‘. Thus, 
all the A colored nodes in Be are covered. 

The nodes in 1.1 are predecessors of thee 
colored nodes that end with an odd number of 1's and two 
O's. The nodes in 1.2 are predecessors of the B colored 


nodes ending with an even number of 1's and one QO. The nodes 


in S‘' also have successors colored A. But, since these nodes 
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have already been covered and no Gonmtluetes exists, the 
details concerning them will be eliminated. 

We see that the nodes that are yet to be covered 
are 

i. All B colored nodes ending in three or more Q's. 
i1. All C colored nodes ending in four or more J's. 
iii. The C colored nodes 2nding in an even number of 
l's followed by two 9's, except for the node 
(24. Biss node is covered by (2*' = 2), which 
Wacmerneluded ino lel. 

Consider now the set of nodes colored A in Bo-3 
not including the node 297-2 -~ 1. This set is doubled by 
taking the successors in Bn-2 and adding the node gn-2 = )2% 
We then multiply each node in the set by 4. £=This 
multiplication will shift the binary representation of these 
nodes 2 places to the left and fill O's in the vacated 
positions. This yields the following nodes in Boe 

2-1 The B colored nodes ending in an odd number of 
l's followed by three 9@'s. 
2.2 The C colored nodes ending in an even number 
of l's followed by two J's, except for the node 
@, = 4)% 
We add these nodes to S'. There is no conflict 


Been che wpodes already in S° because the nodes just added 


are a subset of the nodes that were shown to be uncovered. 
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The nodes in 2.1 are predecessors of the nodes 
colored C that end with an odd number of 1's followed by 
four d's. The nodes in 2.2 are predecessors of the nodes 
colored B that end with an even number of 1's and three Q's. 
Thus, we have the following nodes that are still uncovered. 

i. All B colored nodes ending in five or more Q's. 

11. All C colored nodes ending in six or more 9's. 

1ii. The C colored nodes ending in an even number of 
l‘s and four @'s, except the node (2™ - 24). The 
node (2 - oo) is covered by (es 2°), which was 
added in 2.1 above. 
So, we redouble. 
On the kth doubling, we take the set colored A 


n-2k+1l 


neg B delete the node 2 1, double by taking 


Rhee 2 heels 
the successors of these nodes in Bo-2K42 and add the node 


gn-2kt2 2 to the set. Then, these nodes are multiplied by 


ae shifting their binary representation 2k-2 places to 
the left. The resulting set of nodes is added to S’*. This 
process is done for kK = 1, 2, «-. ) 3am 
If n 1S greater than 2k + 2, we have the 
following nodes that are still uncovered. 
1. All B colored nodes ending in 2k + 1 or more O's. 
ii. All C colored nodes ending in 2k + 2 or more O's. 
L11. The C colored nodes ending inman evens numpe ms 


22k), 


l's and 2k O's, except the node (2" - which 1s 


covered by the node (2 - We = 56 
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Onethe (ny 2)th Boao ae we take the set colored 
A in Bi. {1}, and delete the node Da - l1 = 1, leaving us 
with the empty set. Doubling the empty set still leaves us 
with the empty set. We then add the node 2° - 2 = 2 giving 
us the set {2}. The single node in this set in multiplied by 
22-2 to shift it n-2 places to the left. This gives us the 
node 2"71! to ada to S', which covers the node @. At this 
point, S‘' forms a cover of Bae 

The cardinality of the nodes colored A in oey 
for odd values of n, was given previously as (Gamer 1) /3. 


SO, on the kth doubling, the cardinality of the starter set 


is (gn-2ktl + 1)/3. After deleting one node, we have 
jean ektt - BS: nodes. Doubling this set yields 
(gn-2kt2 - 4)/3 nodes. Finally, by adding one node, 
(qn-2kt2 Swine oe tedes are added to S's To “find the 
cardinality of S', we sum (gn-2kt2 iy Ss  fOssvaluesmof k 
from 1 to n/2, yielding 
io? | = — (gn-2k+2 Se 
k=] 
My 2 
= (> o9n-2kt+2 ne 
k=l 
= {(29t2 _ 4)/3 - n/2}/3 
= ee <Gh=nSn)i/ie. 
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Asymptotically, the ratio of the number of nodes 
in this cover to the number of nodes in the graph is 4/9. 
This is fairly close to the upper bound of 1/2. 

Now that the array of A colored nodes for Ba-l 
is known, the Double and Redouble algorithm can be performed 
in O(V) time, where V is the number of nodes in the graph. 
Given the array of A colored nodes, it costs O(1) time to 
determine any one of the nodes that will ultimately be in 
the cover S' as only a shift 1S required. Since there are 
O(V) nodes in S', the run time of the algorithm isso ge 

Since the nodes colored B are the binary 
complements of the nodes colored A, a similar method can be 
derived’ to -formea. Cover fim B, from the B colored nodes in 
ee ic The method would be basically the same, with only a 
few modifications. Instead of deleting the node gn-2k tT 
on the kKth doubling and later adding the mode gn-2kt- 24 
we would delete the node @ and later add the node ll. Also, 
when we shift the binary representations of the nodes in 
redoubling, we would fill the vacated bits with 1's instead 


of W's. So, instead of multiplying the nodes by gene 


92k-2 2k-2 


Wwe 


would? Imultiolky) oy and add 2 Is 


De, se oad 
For odd values of n, the Double and Redouble 
algorithm is nearly the same as for even values of n. We 
Start with the set collomed 3) taeaen B4-7: This set is 


independent, so we do not have to delete any nodes prior to 


doubling to ensure an independent set S' in Bae There is 
also no need to add a node after doubling. This simplifies 
the process so that we do not have to consider the 
exceptions that were present for n even. 

The doubling method used 1S, again, the method 
of finding the successors in BL ae the nodes in S. This 
gives the following nodes in S'. 

1.1 All B colored nodes ending in an odd number of 
l's followed by a single QO. 
1.2 All C colored nodes ending in an even number 
of ls. 
Here we see that the node ending with (n-1) l's, 
- 1), is in S'. Thus, the node 2" - 1 is covered. 
The following nodes are still uncovered. 
i- All B colored nodes that end with three or more Q's. 
Pi ee colored nodes that end with four or more 0s. 
iii. The nodes colored C that end with an even number of 
l‘s followed by two @'s. 

The A colored nodes in B, are doubled by taking 
Beit SuCcCCeSsOrs in Ba-2° These nodes are then multiplied by 
4 to shift their binary representations 2 places to the 
[te This set is added to Ss’. 

nomen tche case for “even. values of mn, this 
doubling process is repeated. On the kth doubling, the set 
colored A in B-2k+]) iS doubled by taking the successors of 
these nodes in B Then, these nodes are multiplied by 


n-2k+2° 
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22K=< shifting their binary representation 2k-2 places to 


the left. The resulting set of nodes is added to S‘. This 
process is done for k = 1, 2, °% =. (meta) ce 
If n is greater than 2k + 2, the following nodes 
are still uncovered. 
1- All B colored nodes ending in 2k + 1 or more Q's. 
ii. All C colored nodes ending in 2k + 2 or more J's. 
111i. The C colored nodes 2nding in an even number of 
l's and 2k S's. 
For k = (n-1)/2, the set colored A in 35, en). 
is doubled. This yields the set [{2, 3}. These nodes are 


mMuttipl ted {by 22(n-1)/2 52 = on2 to shift their bimang 
representation n-3 places to the left. Thus, the nodes gn-2 
and 3¥Qn-3 are added to S'. However, the node that ends with 


2(n=1)/2 + 1 (=n) 8's (node 0) is stilt t@ncover oa 
We cannot add node 1 to the cover because nodes 


2 and 3 are present. Node gn-l cannot be added to cover node 


n-2 
2 ; 


6 because it is adjacent to the node which has just 


been added. So, we delete the node from the set S Jaa 


was added on the last doubling to cover itself and its even 


n-l 


successor gn-t. We replace oa Wl tee Thus, the node Jd 


Will be covered and no nodes became uncovered. Now S Soa 


cover. 

The cardinality of the nodes colored A when n 1s 
even 1S (9 - 1)/3. So, on the kth doubling, our starter ser 
has (gna2krt Les nodes. Doubling tis yields 
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(2n-2kt2 - 2)/3 nodes that are added to S‘*. Therefore, to 
find the cardinality of the cover, S', we sum (gn-2kt2 -2)/3 


for values of k from 1 to (n-1)/2, yielding 


(n-1)/2 
Is'| = (gn-2k+2 _ 9) /3 
k=1 
(n-1)/2 
aa { on-2k+2 = (ral vic 
k=1 
SPA By ee eee 


Pie 3 = 5) /9 


The cardinality of the cover obtained by 
performing Double and Redouble for odd values of n is the 
same as the cardinality obtained by the Sequential Fill 
algorithm for n odd. These covers contain approximately 4/9 
of the nodes in the graph. 

A similar algorithm can be developed to form a 
cover for n odd that uses the nodes colored B in Bo-1 as a 
starter set. This method would differ from the odd n using A 
colored nodes in the following ways. When shifting the 
binary representation of the nodes in redoubling, the 
vacated bits are filled with 1's instead of J's. Thus, 


instead of multiplying by we would multiply by 


and ada 2“%-4 - 1. also, instead of replacing node ee by 
ay glee Il 


nodes on the last doubling, we would replace node 


Meee ee = ME se on—2Z) 2 Aith node 297? - 1. 
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When the A colored nodes in Be were used as 


=] 
the starter set, the following was observed. Through n = ll, 
after the initial doubling, if we were to sequentially ages 
from the remaining available nodes, the same cover that was 
formed by Double and Redouble would result. This holds true 
for both even and odd values of n. 

When the B colored nodes were used as the 
Starter set, for even values of n, we could produce the same 
cover formed by Double and Redouble by doubling th2 starter 
set and sequentially filling from the remaining availabie 
nodes. To produce the cover formed by Double and Redouble 


for odd values of n, double the starter set and £11) gee 


the remaining av2ilable nodes in reverse sequential order! 


ae Double and Cover 
THis method of Obtaining aseeger in Bi is simi lanpgee 
Double and Redouble in tnat the starter set used is formed 


By o—Color ing. ss We also double that starter set by 


is 
taking the successors of those nodes in Bae However, DOUDme 
and Cover differs in the way that the remaining nodes in the 
graph are covered. 

aie oT) OCG 


For odd values of n, we choose the set of nodes 


colored C in B._, as the starter set S. However, to make S 


an independent set, we must delete the nodes O and gn-l - 1 


ved 


from it. We then double the set S by taking the successors 
of these nodes in B. to form the independent set S'. 
We are now able to add the nodes 1 and 2" - 2 to 
S' to cover the end nodes, without creating a path with any 
of the existing nodes in S‘'. We do so and justify this 
action presently. 
Since the nodes in S were colored C in Buel the 
nodes in S°* are 
1.1 All B colored nodes ending in an odd number of J's 
greater than 1, except the node JO. 
1.2 The B colored nodes ending in an even number of 
1l*s and one @. 
1.3 All A colored nodes ending in an even number of 
1's greater than 1, except the node 2" - 1. 
1.4 The A colored nodes ending 1n an @#ven number of 
O's and one l. 
Let us now justify the addition of the nodes 1 
and 2" - 2 to $' to cover the end nodes. 
The nodes that create a path with node 1 are the 
Medes 2, 3, 9 and gn-1, The last three bits of node 2 are 
meee Thus, it is colored B, but it is in neither 1.1 nor 
1.2. The node 3 ends in @11, so it is colored C and is thus 
fein S . The node @ is in S* only if it is also in S. 


However, we specifically deleted node @ from S. And, the 


node 297! is imo, Only 1£ the node gn-2 toni Mio Sut, LOr 


Yee 


odd values von gn-< 


is colored 8, excluding 1t frome 
Therefore, none of the nodes adjacent to node 1 are in S’. 
The nodes that are adjacent to th= nod] 2) ame 
are the nodes 2°)—- 4.09 —amce gn-l - 1 and 2" = 1. The node 
2" .~ 4 ends in two @'s. Thus it is colored C and 1s smccme” 
S'. The node 2" = 3 ends in 181, so it is colored A, bus 
is in neither 1.3 nor 1.4399 Node gn-l - 1 ends in (n = 1) 


an-1 


ls: Sincesn 1s ocn, - 1 is colored C and 1S not ing 


Finally, the node 2" = 1 is“in"™S”™ only] wena ned qn-t al 


nS 2 vo. PEE, geen - 1 was specifically deleted from 5S. 
Therefore, none of the nodes adjacent to the node 2° — Oe 
iL 9S 

Consider now the nodes in BF that are covered by 
the set S'. The nodes colored C are predecessors of the 


nodes in S’, so they are covered. The nodes in 1.1 abege 
cover all A colored nodes ending in an odd number of O's 
greater than l, followed by one 1. The nodes in 1.2 cover 
all the A colored nodes ending in an even number of l's 
followed by Jl. With these sets covered by nodes in S’ and 
with the nodes in 1.3 and 1.4 included in S‘', we See that 
the only A colored nodes that are not covered are those 
ending in an odd number of 1's followed by Jl. We refer to 
this set of uncovered A nodes as UA. 

Similarly, the nodes in 1.3 cover all B comt@ped 
nodes ending in an odd number of 1's greater than Il, 


followed by one 8. The nodes in 1.4 cover all B colored 
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nodes ending in an even number of O's followed by 19. With 
these sets covered by nodes in S' and with the nodes in 1.1 
ema l.2 included in S’, we see that the only B colored nodes 
that are not covered are those ending in an odd number of 
Q's followed by 18. We call this set of uncovered B nodes_ 
UB. 

Since adjacent nodes in the graph cannot be the 
Same color, we Know that UA and UB are both independent 
sets. We add one of these sets, say UA, to S'. 

Consider now the nodes of the form xx...xJ1Ol, 
feetme cach x Can be a O or a i. This is a subset of UA. The 
(n-4) bits represented by x's can be any combination of U's 
and l's. To find one of the predecessors of the node 
me xG101, we shift the bits one place to the right and 
Fill the vacated bit with a © (divide by 2). This gives us 
O@xx...x019. To find the other predecessor, we shift the bits 
One place to the right and fill the vacated bit with a l 
(divide by 2 and ada 2971). This yields 1xx...x919, which is 
the companion of Oxx...x@10.. Thus, among the predecessors 
of the nodes in UA are the nodes ending in an, odd number of 
G's followed by 10. This is precisely the set UB. So, by 
adding the set UA to S'‘', we cover the set UB. Similarly, 
adding the set UB to S', would cover the set UA. Thus, by 
adding either the set UA or the set UB to S', the cover is 


completed. 


15 


There are two exceptions that should be 
mentioned. If we add the set UA to S' to complete the cover, 
we cannot add the node ending in (n - 2) 1's followed by Ol 
(node 2" ~ 3). This would form a path with the node 27 =m 
2° ~ 2 was added to S' after doubling to cover 2” =m 
Likewise, if we add the set UB to S‘', we must exclude the 
node ending in (n - 2) B's followed by 10 (node 2). Node 2 
forms a path with node 1, which was added after doubling to 
cover the node 93. However, these exceptions leave no node 
uncovered and allow for the end nodes to be covered. 

Because of the structure of nodes in UA and UB, 
we can easily determine the cardinality of the cover just 
formed. The set UA is made up of the nodes ending in 
(2k - 1) 1's followed by 931, where k = 1,2,...,(n=3)0ee 
k = (n-1)/2 is disallowed because of the conflict with the 
node 2" - 2. Since the bit prior to the (2k - 1) long Stiaime 
of l's must be a G8, there are (n = 2k = 2) bits thai 


free to be either a 9 or a ls Thus, there ane gn-2k-2 nodes 


ending in (2k - 1) 1's followed by G1. To find aipae 
cardinality of UA, we sum gn-2k-2 for Kk = 1,2,...,(n-2 0 
(n-3)/2 
[onl =. By ae 
k=1 


= (H(2"° “Seah a 


(2054 23572. 


[UA | 


ES 


Since the nodes in UB are the binary complements of the 
nodes in UA, |UB] = |UAI. 


The cardinality of the set of C colored nodes in 


Bo4-1 is (poe + 2)/3 for odd values of n. Deleting nodes 9 
and 2"-! jeaves (nl - 4)/3 nodes in the starter set S. 
Doubling S and adding the nodes 1 and 2" - 2 results in 


(2° - 2)/3 nodes in S'. Completing the cover by adding the 


set UA to S‘ yields 


lis] 


Gis= We. sent" > =9)7/3 


(5*2"-2 _ 4)/3. 


The ratio of the number of nodes in the cover to 
the number of nodes in the graph is approximately 5/12. 
Pimele this is fairly close to the upper bound of 1/2, it is 
not quite as close as 4/9, obtained by the Sequential Fill 
algorithm for odd values of n and the Double and Redouble 
algorithm for odd values of n. 

The Double and Cover algorithm requires O(V) 
time, where V is the number of nodes in the graph, since the 
set of C colored nodes is Known. The doubling process 
reguires 1 multiplication and 1 addition for every node in 
the starter set S. While the filling process requires 1 


addition for every node added to S'. 


77 


b. on even 
For even n, the Double and Cover method works 
very much the same as it does for odd n. The starter set 5S 
is the set of C colored nodes in B._,. This set is 
independent, so there is no need to delete any nodes prior 
to doubling. We again double the set S by taking the 
successors of those nodes in B, to form the independent set 
S'. This gives the following nodes in S’. 
1.1 <All B colored nodes ending in an odd number of aie 
greater than l. 
1.2 The B colored nodes ending in an even number yea 
is and sone... 
1.3 All A colored nodes ending in an odd number of ls 
greater than l. 
1.4 The A colored nodes ending in an even number of 
OS" a tiers One mis. 
Since n is even, the node nes ends in an odd 
number of J's and is included in 1.1. Thus, the node Gai 


gn-l 1 ends in an odd number 


covered. Likewise, the node 
of I"s and is included in 1.3, cOvenmiInGgwenemnede 2 ae 

As was the case for odd n, we can partition the 
remaining uncovered nodes in B into two independent set 
that we will again call UA and UB. In the set UA, are the A 
colored nodes that end in an odd number of 1's followed by 


91. The set UB consists of the B colored nodes that enduiie 


an odd number of O's followed by 10. Because we did not add 
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any nodes to S’ to cover the end nodes after doubling, there 
are no exceptions as there were for odd values of n. AIlso, 


the set UA is covered by the set UB and vice versa. Thus, we 


at 


can complete the cover by adding either set to S 


n-2k-2 


Again, there are 2 nodes that end with 


(2k-1) l's followed by O61. For even values of n, k ranges 
from 1 to (n-2)/2. Thus, the cardinality of UA is given by: 


(n-2)/2 : 
lua | gn-2k~2 


k=l 
Ge col) (4) 
= (2972 _ 4)/3 


The nodes in the set UB are the binary complements of the 
Medes in the set UA, so |VAI| = |UBI. 

BOr even values of nm, the cardinality of the set 
of C colored nodes in Sia is (2n-t = 2))/ SeMbOUDI Ano otis to 
menm S', we have |S'| = Grad - 4)/3. A@ding either the set UA 
or the set US to S‘' to complete the cov2r, we obtain the 
following expression for the cardinality of S’. 

ae (2° = a3. Se 2 1/3 


(soo wees 


Thus, for even values onion, the fraction of nodes in 3, that 
“memim the cover is 5/12. 
It was observed for poth odd and even ordered 


graphs through B,,, that if we complete the cover by 
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sequentially filling from the remaining uncovered nodes 
rather than adding either the set UA or the set UB, we 
obtain a cover whose cardinality is equal to the one formed 
by Double and Redouble. Unfortunately, sequential filling is 
an expensive technique if only a portion of the cover is 
formed in that way. It is only when the entire cover is 
formed by sequential filling that the cover can ve formed 
efficiently. 

In the next section, we present a general method 
to increase the cardinality of a given cover under certain 
conditions. If the increase in cardinality is not £00 gQuenuam 
which is the case in Double and Cover, the expense of 


building up the cardinality is tolerable. 


G. BUILD-UP 

When determining the lower bound for the cardinality of 
a covering set S, in section II.E, each node in S$ was sovete 
responsible for covering four distinct nodes in BY - Sa 
that case, if any given node was removed from S, then the 
Four nodes adjacent to it could be brought in, increasing 
the cardinality of S by tnree. 


In most covers, a 4-for-l exchange will not be possible. 


However, if a node x, in S, is solely responsible for 
covering k nodes in By - S, a k-for-l exchange can be made. 
This increases the cardinality of S by k - l. The new set 


will still be a cover. The node x is now covered by each of 
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the k nodes brought inta S. Any nodes previously covered by 
x that are not brought into S are covered by other nodes in 
S, otherwise they would have been part of the exchange. 

If no k-for-1l exchanges in which k > 1 are possible, 
then only l1-for-l exchanges can be made. While making such 
an exchange has no immediate effect on the cardinality of 
the cover, it may make it possible to increase the 
cardinality by future exchanges. 

Sracbting With a cover S in Boe we repeatedly make 
k-for-l exchanges where k 1s as large as possible. In this 
way, the cardinality of the covering set can be increased. 
We continue this process until either all of the nodes in By 
- S (except 9 and 2" - 1) are covered by more than one node 
am S aa a cycle of l-for-1 exchanges occurs. 

As an example of this process, consider S$ = {1, 4, ll, 
14}, a covering set of Bae The node 4 is the only node 
covering nodes 9 and 10. If 4 is deleted from S, then 9 and 
19 can be brought into S, resulting in S‘' = {1,9,190,11,14}. 

At thiS point, we can do no better than the Il-for-l 
exchange of removing node 1 from S°* and adding node 8, 
Weeving us with S'' = {8, 9, 19, 11, 14}. 

From this set, if node 9 is deleted, nodes 2 and 3 can 
memeadded, yielding S‘'’' = {2, 3, 8, 160, 11, 14}. With this 
cover, each node except 9 and 15 is covered by more than one 
node, and the algorithm stops. The cardinality of the 


resulting cover is 6. This is the maximum cardinality found 
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in the exhaustive search for n = 4, so the Bui le=we 
procedure yields a covering set of maximum cardinality for 
n= 4. 

There are two situations which can arise that will cause 
the Build-up algorithm to fail to work properly. Wha 
neither is likely to occur, they are both real possibilities 
and must therefore be guarded against. 

The first Situation arises when the node to be removed 
from S lies on a 3-cycle, such as (4,9,18) in Be. If one of 
the nodes, say node 4, 1s in S and none of the other nodes 
adjacent to nodes 9 and 18 are in S, then only node 4 covers 
nodes 9 and 18. The Build-up routine would attempt to 
increase the cardinality of S by deleting node 4 and adding 
nodes 9 and 18. This, however, would result ina set that 1s 
no longer independent, since a path exists from 9 to 18. 
Because of the relationships of the nodes adjacent to the 
nodes on a 3ecycle, the situation described above is 
impossible for n < 5 and seems unlikely to occur for larger 
n. Also, as there are only two 3-cycles, (0981) and (0811), in 
the de Bruljn graph for each n > lil, the likelihood of @ens> 
Situatlon occurring does not increase as n gets larger. 

The second difficulty to be avoided occurs as a Natta 
process of the Build-up algorithm. As nodes are removed and 
subsequent nodes are added to a cover, Build-up tendsiiee 
pack nodes in the cover more closely together. The process 


does this because then, more nodes can be added to the set. 
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However, in this shifting around process, there is nothing 
to prevent the nodes that cover the end nodes, 9 and 2” -1, 
from being deleted and being replaced with nodes that do not 
cover the ends. If this happens, the resulting set will not 
be a cover. One solution to this problem is to peg down the 
nodes that cover @ and 2" - 1 and never allow them to be 
removed from the set. 

The process of finding the nodes in S, if any, that are 
solely responsible for covering nodes in Bh - S, is the 
@eminant factor in the cost analysis of Build-up. For each 
node x, in S, the nodes adjacent to x must be checked to see 
if they are adjacent to any other nodes in S. Since there 
are O(V) nodes in B. - S$ to be checked against O(V) nodes in 
S, this process requires O(Vv7) time. Thus, each iteration 
of the procedure costs O(Vv7) time. 

This can get quite expensive 1f we Build-up the 
cardinality of a given cover by a large amount. As _ an 
example, consider the cover obtained py 3-coloring as far as 
possible. It is possible to increase the cardinality of this 
cover from about (1/3)V to about (4/9)V. Thus, we perform 
the Build-Up routine at least (1/9)V times. In this case, 
the entire Build-up process is at least o(v?) in cost. 

The Bulld-up routine was used in conjunction with the 
results from previous algorithms in an attempt to increase 


the cardinality of those covers. Tables III.2-6 show the 


results when the Build-up routine is applied to the results 


a3 


of Frugal, Sequential Fill, 3-coloring, Double and Redouble, 
and Double and Cover respectively. 

An interesting note is that neither the results from the 
Sequential Fill algorithm for odd values of nn non w=eme 
results from the Double and Redouble algorithm could be 
bullt-up to a cover of higher cardinality. However, Gime 
does not imply that the ultimate cover obtained by these 
algorithms is the largest obtainable for that value of n. 
Covering sets found by other algorithms build-up to greater 
cardinalities for the same values of n. For instance, both 
the Sequential Fill algorithm and the Double and Redouble 
algorithm for n = 11l yielded a cover of cardinality 9906. The 
Frugal algorithm, on the other hand, yielded a cover o£ 
cardinality 917 for n = 11 and was further built-up to 9gzze 

It is also interesting to note that the Build-up routine 
increased the cardinality of the cover obtained by the 
Double and Cover method to that of the cardinality Of @ieae 
Double and Redouble method. 

It 1s possible that a k-for-} exchange of nodes between 
S and B| - S, where k > j > 1, could result in a cover OF 
higher cardinality for the Sequential Fili sets for n odd or 
the Double and Redouble sets for any n. The same is true for 
the largest cover obtained by Build-up for any n. However, 
no program which would accomplish such an exchange has been 
written for this problem. We leave this for future 


researchers to attempt. 
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TAS 2 tier 


BUILD-UP APPLIED TO FRUGAL ALGORITHM 


Upper Frugal & 
n Bound Prugalmeoud )d—up 
1 % g ) 
Z L } 1 
3 2 2 2 
4 6 6 6 
5 rz 12 12 
6 S72 25 2,5 
7 64 ae 54 
8 2 3 88 le 
2 250 IN ERDAS 
16 512 S79 452 
Pel 1624 Sil fi way 


TASB eer. 3 


BUILD-UP APPLIED TO SEQUENTIAL FILL ALGORITHM 


Upper Sequential Seq. Fill 


n Bound Baby & Build-up 
iL B QB %5) 
2 L 1 1 
3 2 2 2 
4 © 5 6 
5 eZ 12 12 
6 OZ Pal 27 
7 64 54 54 
8 MA} 85 de: 
9 2510 224 224 
10 Dee 341 461 
El 1924 956 956 


oS 


TABLE J iiiwe 


BUILD-UP APPLIED TO 3-COLOR ALGORITHM 


Upper 3-Color & 
n Bound 3=-Color Build-up 
i 4) 4) 0) 
2 it i 1 
3 2 2 2 
4 6 5 6 
= eZ 1 eZ 
6 a2 7as\ 2, 
ii 64 42 54 
8 T2323 85 al 2 
2 256 179 224 
19 Sil 341 453 
iba 1624 Gaz cael 


TABLE III-<> 


BUILD-UP APPLIED TO DOUBLE & REBDOUBEE ALGORTiin 


Upper Double & Double & Redoudle 


n Bound Redouble <) Bua d= 
i U 0 G 
Zz 1 iL IL 
3 2 2 2 
+ 6 6 6 
5 8m WH 1 
6 52 2 27} 
y, 64 54 54 
8 128 PipZ Tai 
9 256 224 224 
16 DileZ 453 453 
aul 1924 986 906 
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TABLE III.6 


BUILD-UP APPLIED TO DOUBLE & COVER ALGORITHM 


Upper Double & Double & Cover 
n Bound Cover & Build-up 
I! 4) 0 g 
2 1 4) | 
3 2 2 2 
4 6 5 6 
5 h2 2 i 
6 62 25 27 
7 64 > 2 54 
8° We ig 112 
9 256 2in2 224 
1Y pli 2 425 453 
IT 1824 852 906 


H. CONCLUSIONS ON MAXIMAL COVERINGS 

In comparing the algorithms developed to form a maximal 
or Fear=maximal cover, we look first at the cardinalities of 
the sets produced by the various algorithms. Table III.7 
favo eae for nm < 10, the Double and Redouble algorithm 
yields the largest covering set produced by any of the basic 
megorithms. It was shown in section III.F.1 that the ratio 
of the number of nodes in the cover to the number of nodes 
in the graph tends to 4/9 for the cover formed by Double and 
Redouble. This is very near the upper bound of 1/2. For odd 
n, the performance of Double and Redouble is equaled by 
Sequential Fill. 

The performance of the Frugal algorithm is-~ also 
Meteworthy. While the Frugal algorithm performs only 


moderately well for even values of n, it performs quite well 


87 


TABLE II1.7 


RESULTS OF BASIC MAXIMAL ALGORITHMS 


Upper Seq. Double & Double & 
n Bound Frugal |p a ie 3-Color Redouble Cover 
il 4) 4) g ) 4) 4) 
Z 1 il iF 1 1 a) 
3 Ze 2 2 2 2 2 
4 6 6 5 5 6 5 
5 12 12 12 10 12 12 
6 a2 25 21 DMN Z7 25 
7 64 53 54 42 54 a2 
Se Ze 88 25 85 el 2 NB 31S 
») 256 27 224 172 224 21n2 
13 S12 373 341 341 453 425 
ll 19824 oe) 996 682 936 So) 
for n odd. For n = 3, 5, /, & Othe cover .oreeuceamaas 


Frugal is nearly as large as the cover produced by Double 
and Redouble. For n = ll, the Frugal algorithm produces 
cover of greater cardinality than the one produced by Double 
and Redouble. 

By applying the Build-up routine to the covers formed by 
the basic algorithms, we are able to increase these covers 
in cardinality. The exceptions are for Double and Redouble 
and Sequential Fill for n odd. In Table I11.8, we Sea w@ene 
results Of buildinog-us each cover jas much as the routine 
would take it. In each case, we form a cover that 1S very 
close in cardinality to the cover formed by Double and 
Redouble. For n > 8, we are able to Build-up covers tCigaaie 
actually larger than the ones produced by Deublewane 
Redouble. Wowever, they are larger by only a small amount. 


This leads us to believe that while Double and Redouble 
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TABLE lliers 


By Leb SUP ne eee TOSTHE BASIC = MAxIMAL ALGORITHMS 


Upper seq. Double & Deuble & 

n Bound Frugal Bot SiC 6 ors Redouble Cover 
1 4) 4) 4) 4) a) J 
2 iL i if il: 1 4) 
3 2 2 2 2 2 2 
4 6 6 6 6 6 6 
5 2 Zz e2 in2 dez eZ 
6 32 2 | 27 oy 27 
7 64 54 54 54 54 54 
8 P28 steed es pez i ale 
© 220 226 224 224 224 224 
1d 2) 10 452 461 453 453 453 
ll 169624 oi) 986 oe 9B O26 


Mees not produce a cover of maximum cardinality for all n, 
mae upper bound on the cardinality of a cover of Bn is 
Mmieser to 4/9 than it is to 1/2. 

Cems veturm Our attention to the cost of producing 
these covers. In Table III.9, we see that all of the basic 
algorithms, except Frugal, cost O(V) time to perform. Frugal 
costs O(v7) time. However, to increase the cardinality of 
some of these covers, the Build-up routine was applied. Each 
application of the Build-up routine costs o(v4) time. cis 
Setves Up the cost of producing these covers to at least 
O(v-). The covers produced by Frugal, for n even, 3-Color 
eae oegquential Fill, for n even can be built-up O(V) times. 
When this is done, the overall cost of producing the cover 


goes up to O(v?). 
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TABLE 12229 


COST COMPARISON OF MAXIMAL ALGORITHMS 


Algorithm Cost Cost to Build-Up 
jee bie faVh 

a. on Even O(v2) Ov) 

b. n Odd o(v7) o(v*) 
Sequential Fill 3 

Aue. ne ewen O(V) O(V~) 

bs. “nee@dd O(V) ---- 
3-Color Oly) O(v?) 
Double & Redouble O(V) ---- 
Double & Cover O(V) O(V-) 


Thus, considering both the cardinality of the cowaE 
obtained and the cost of producing that cover, the Douoie 
and Redouble algorithm is the method of choice for producing 
near-maximal covers. Om Odd saa, the Sequential ~~ Fam 


algorithm is equally desirable. 
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IV. MINIMAL COVERINGS 


A. INTRODUCTION 

mye = JGOalmOn this chapter 1s to find a cover of B, whose 
cardinality approaches the lower bound established in 
section II.D. We first attempt to form such covers by 
implementing a greedy method. We then present a method of 


Seecitioning the nodes in BO that results in a cover whose 


cardinality is very near the lower bound. 


Be GREEDY 

The solution to a wide variety of graph theoretic 
problems is found by processing the graph in ae greedy 
fashion. For our particular problem of finding a covering 
set of minimum cardinality in Be a greedy-type algorithm 
seems to be a very logical way to proceed. In forming the 
Bevering set S, we include nodes in S that cover the most, 
as of yet, uncovered nodes. The EOLVOVINnG® “algorithm 
implements this greedy method. 

pligorithm: Greedy 


Seco. celect nodes! and Saeher node D2 PE) ei 
node 2n7l l1 to be in the set. (This is done to 
ensure that the resulting set will be a cover.) 

Step 2. From the remaining uncovered nodes, add 
the node with the most uncovered neighbors. In case 


of a tie, add the smallest such node. 


Or 


Step 3. If all nodes are covered, the set is a 


cover, stop. If mot, Go tomstecpez. 


This algorithm is very similar to the Frugal algoriimag 
presented in section II.B. In the Frugal algorithm, we chose 
the node with the fewest uncovered neighbors, while the 
Greedy algorithm chooses the nodes with the most uncovered 
neighbors. 

As in Frugal, we reduce the number of combinations to 
cover the end nodes by forcing the node 1 to always be in 


the cover. Once node 1 is in the cover, we have no choice to 


make on how we cover the node 2" = 1 until n = 4. For n = JL 
no cover exists. For n = 2, the node 1 forms a cover by 
itself. For n = 3, the node 3 is a descendent of node Tee 


node 6 1s included in the S$ to cover node 7. 

For n > 4, there is a choice on how to cover Ciouiaemm 
2. - 1. However, the only case for 4 < n < 11 for whichis 
cardinality of the covering set is affected by the choice 1s 
n = 6. When 2° - 2 = 62 is included in S$, the cardinality 
of S is 18. When 2°? - 1 = 31 is chosen to cover node 63, the 
Cardtinality “of Jo. 1s. lo 

The cost of performing Greedy 1s the same as that of 
Frugal,. 1.e- OCW Vx Thus, we are able to use this algorithm 
to find covers of the de Bruijn graph for several values of 
Dee 


The results of Greedy for n < 12 are seen in Table 1 /aae 


The lower bound for n < 6 has been adjusted bas=a ener 


NZ 


results of chapter MiweeNotewcthatemthne  Gardinality of the 


cover formed by the Greedy algorithm is greater than the 


lower bound for n = 5. Since the lower bound was achieved 
for n = 5 by exhaustive search, Greedy is not a minimal 
eergorithm for n = 5. 


TABLE IV.1 GREEDY ALGORITHM 


Lower 

n Bound Greedy 
ii 4) Q - 
Z sf 1 
3 2 2 
+ 4 4 
5 8 S 
6 i 16 
7 20 on, 
3 De 78 
2 PS lore 
16 235 Sao 
lk 4190 S05 


ee QUARTERING 
We form a cover S in B_ by partitioning the nodes in BL 


mreo 4 blocks of equal size. 


Brock 1. The nodes from 0 to 2974 - 1, 
Elock 2. The nodes from 2°74 to 2971t - 1, 
Block 3. The nodes from 27! to 3*2M74% - 1, 
Block 4. The nodes from 3*2"72 or 2 = 1 


Place in the set S the nodes in block 1 whose binary 
representation ends in @l, the nodes in block 2 whose binary 
representation ends in 60, the nodes in block 3 whose binary 
representation ends in 1l, and the nodes in block 4 whose 


binary representation ends in 10. Since the number of nodes 


ere 


selected from each block is 2°74, there are a total of 27? 
nodes in S, or 1/4 of the nodes in th graph. We will refer 
to the nodes in S fEroem block) aeae S;, i. = 10a 

In showing that S is an independent set, it suffices to 
show that the successors of the nodes in S are not also in 
S. Since the nodes in block 1 are between 9 and gn-2 - il, 
their successors lie between J and gn-l - l, i.e. blocksu 
and 2. The successors of a node ending in Yl end in either 
POeor Ji Tacs ene nnodes win S; have successors in Dblioc@eam® 
and 2 that end in either 10 or 11. However, the nodes taken 
From block 1 in S end in Ol, while the nodes from bloc ry Zaae 
S end in 96. So, the successors of the nodes in Sy are not 
Ehia So 

Ge successors of the nodes in block 2 lie between 227} 
Bvalle Oe ee ER. 1ge8w, blocks -3 vand. 34. A node ending in 90 has 
Successors that end in either 989 or 91. Thus, the nodes in 
S, have successors in the blocks 3 and 4 that end in ges mim 
bm 00 “er Oi, But, the nodes. in S 3 end in 11 and the nodes in 
Sy end in 10. Therefore, the successors of the nodes in S5 
areuwnoec lito. 
The successors of the nodes in block 3 lie between O and 

= tlie tee.) DOCKS. «1 cana. The successors of a node 

ending in 11 ends in either 1@ or li. Thus, the nodesm@ia S 5 


have successors in the blocks 1 and 2 that end in #ither dn 


LOO tte l i However, the nodes in Sy end in 91 and the nodes 
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178 So end in 9@. So, the successors of the nodes in S, are 
net in S. 

Finally, the successors of the nodes in block 4 lie 
between 227! ang 2" - 1, i.e. blocks 3 and 4. The successors 
of a node ending in 16 end in either 89 or 61. Thus, the 
nodes in Sy have successors in the blocks 3 and 4 that end 
in either in 96 or @1. But, the nodes in S3 end in ll and 
the nodes in Sy Sra Ore SOs Ene ‘successors sof the nodes 
in Sy are not in S. None of the successors of the nodes in S 
are also in S. Therefore, S$ 1S an independent set. 

AS stated earlier, the successors of the nodes in Sy lie 
im mioeks l and 2. Because the nodes in Ss) end in @l, the 
last 3 bits of their successor nodes are 910 for the even 
successors and @ll Lon the odd successors. ine WOulr 
Mamelctilons, a node and its companion cannot both be in the 
same block. Therefore, there are gn-4 distinct nodes ending 
in 919 and 25-4 gdistinct nodes ending in 911 that are 
successors of the nodes in Sy. Since this is the number of 
nodes in blocks 1 and 2 that end in this manner, all the 
nodes in blocks 1 and 2 ending in 919 or 011 are covered. 

The successors of the nodes in S also lie in blocxs 1 
and 2. They are the ne nodes that end in 119 and the aaa 
nodes that end in 111. Thus, the nodes in S), together with 
mae nodes in S3, cover all the nodes in blocks 1 and 2 that 


mmaein 16 or ll. 
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In a similar manner, the nodes in S5, together with the 
nodes in S,, cover the nodes in blocks 3 and 4 that end in 


99 or Ol. 


We have yet to snow that the following nodes are 


covered. 
A. The nodes in block 1 that end in 989. 
B. The nodes in block 2 that end in @l. 
CG The nodes in block 3 that end in 198. 
D. The nodes in block 4 that end in ll. 


The nodes in the first half of A (the nodes ending in 9 
that lie between 9 and aie 1) have successors that lie in 
block 1. The even successors end in 968 and the odd 
Successors end in 91. Thus, the nodes in the first half of A 
are covered by the nodes in Sy. The nodes in the second nalf 


gn-3 ane 


of A (the nodes ending in 36 that lie between 
gn-2 oo 1) have successors that lie in block 2. The even 
Successors end in 00 and the odd successors end in Gl. Thus, 
the nodes in the second nalf of A are covered by the nodes 
in Sj. Similarly, the nodes in B are covered by Eleame 
Successors that are in S; and S,, the nodes in C are covered 
by their successors that are in Ss} and S5, jnd the nodes jem 
D are covered by their successors in S and Sy). 

Thus, the set S, formed by this quartering scheme) 
covers B.. The cardinality et Sesseul oo 1/4 of the 
nodes in B., which is very close to the lower bound of 


Paar It 1S noteworthy that the Quartering scheme produced 


96 


a cover equal in size to the minimal covers found by the 
exhaustive search for n < 6. 

In determining the nodes to be placed in S, it is only 
necessary that we check the contents of the last two bits in 
the binary expansion of the nodes in BA: This can be done in 
O(V) time. Thus, we have an inexpensive method to form a 


cover whose cardinality is very near the lower bound. 


D. CONCLUSIONS ON MINIMAL COVERINGS 

Of the two methods presented to produce minimal or near- 
minimal covers, clearly the Quartering scheme is the method 
of choice. In Table IV.2, we see that for n > 6, the 
cardinality of the cover produced by Quartering is 
considerably less than the cardinality of the cover produced 
by Greedy. And the cost to perform Quartering is only O(V), 
while the cost to perform Greedy is O(Vv*). While the Greedy 
method seemed to be a very natural way to proceed, 


Quartering yields much better results. 


TABLE IV.2 RESULTS OF MINIMAL ALGORITHMS 


Lower 
n Bound Greedy Quartering 
iL 4) Y) 4 
2 I 1 1 
3 2 2 2 
4 4 4 4 
5 8 5) 8 
6 ne 16 16 
fi 26 ao a2 
8 Be 78 64 
S C3 158 128 
19 285 313 256 
La 416 65 lig 


V. FURTHER LUD TES 


In exploring the bounds on the cardinality of covering 
sets of the de Bruijn graph, many questions remain 
unanswered. 

In investigating maximal covers, the parity of n played 
a much greater role than expected. The effect that the 
parity of n has on Sequential Fill, Double and Redoubie, jam 
Double and Cover is understood to a degree. However, the 
difference in the performance of the Frugal alcgosimeiga 
between even and oddn is, as of yet, unexplained. 

Frugal finds the node in BL that is adjacent to the 
fewest, as of yet, uncovered nodes and adds this node to the 
covering set. In the event of a tie, tne smallest such node 
1s taken. Perhaps, other tie~breaking schemes might improve 
the performance of Frugal, especially for n even. A Monte 
Carlo scneme that randomly chooses from among the tieing 
nodes might be implemented. 

In the Greedy algorithm, the node in BS that is 
adjacent to the most, as of yet, uncovered nodes is added to 
the cover. Ties are also decided in Greedy by choosing the 
smallest node involved in the tie. The performance of Greedy 
might also be improved by employing different tie-breaking 


schemes. 


oye! 


Another area for further research is the Build-up 
routine. Presently, the Build-up routine makes a k-for-l 
exchange between the nodes in BL - S and S in an attempt to 
increase the cardinality of the cover S. An algorithm that 
makes a K-fOn—=j exchange, where K > j > 1 mene be able to 
further increase the cardinality of the cover to a level 
higher than the present Build-up routine does. 

A process that might yield insight to both the upper 
and lower bounds, but has not been studied as of yet, is 
Random Fill. In this process, nodes are chosen at random and 
muceemlaced In @ COver if no conflict exists with any 
previously chosen nedes. Perhaps, by performing Random Fill 
many times for a given value of n, more can be learned about 
the extremes of the range of values that the cardinality of 
eeeover can take on. We can also apply the Build-up OU erm 
to the results of Random Fill. Since the Build-up routine 
produces slightly different results for the different covers 
that were used as input, Random Fill might produce a cover 
mmc will build to a higher cardinality than previously 
Setarned for that n. 

iieOowseresearen, the lower bound was not explored yuite 
Seeeeully as the upper bound. This is due, in part, to the 
fact that a Build-down routine was never developed. Such a 
routine would take a covering set S and from it form a cover 
Seeeor lesser cardinality. With this tool, we could further 


probe the lower bound. 
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However, it is felt that the lower bound on the 


n=-2 nodes, or 1/4 of 


Garadinallty of a "cev ene. BO should be 2 
the nodes in the graph. 

As noted in section I.C, the nodes in BL have a l-to-l 
correspondence to the arcs in oe Consider the conditions 
that are imposed on a set of arcs in B4-) that correspond to 
a covering set nodes in BAY In these conditions, there seems 
to be a relationship between a minimal covering set of nodes 
in B, and the arcs in B__, that deal with a graph theoretic 
concept known as maximal matching. It is not difficulUies 
prove that nodes in B, that form the Quartering cover 
correspond to arcs in B _, that form a maximal matching. 
However, as of yet, a proof has not been made which shows 
that the number of nodes in a cover in BL can be no less 
than the number of arcs that make a maximal matcning in 

n-l 

Certainly, a suitable topic for further research is to 

Drove “or  disoreove thsat.a (cover -17 Be cannot be formed with 


on-2 


fewer than nodes. 
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